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ANALYSIS OF TRACER EXPERIMENTS: II. HOMEOSTATIC 
MECHANISMS OF FLUID FLOW SYSTEMS* 


H. E. HART 
DEPARTMENT OF PHYSICS, COLLEGE OF THE City OF NEW YORK AND 
DIVISION OF NEOPLASTIC DISEASES, MONTEFIORE HOSPITAL, 
NEw YORK, N. Y. 


It is shown that, for an idealized steady-state system of compart- 
ments in which time lags, diffusion and fatigue effects can be neglected, 
modified tracer methods can be used to obtain an approximate description 
of the characterizing non-linear differential equations of the system. 


Introduction. In recent papers by L. Danziger and G. L. Elmer- 
green (1956), H. E. Hart (1955, 1957), and others, it is suggested 
that tracer methods be used in the study of homeostatic mecha- 
nisms. It is shown in Section I that perturbation and tracer meth- 
ods can in fact be combined to permit examination of the regulatory 
properties of a multi-compartment system. In Section Il, special 
cases are discussed. The experimental demands made by the 
analytical procedures here developed are not generally attainable 
in biological investigations at present although it is possible that 
certain simple systems can be treated. 


Table of Notation 


xX igh”) = concentration of stable (non-tracer) material in compart- 
ment 2 during the gth study. 
X.. = concentration of stable (non-tracer) material in compartment 


zat time L. 
r,. =rate of fluid flow from the jth to the ith compartment. 


V, = volume of compartment 7. 


*Research supported by the Atomic Energy Commission Contract 
AT (30-1)-1551. 
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Gj q(t) = wee = transfer coefficient during the qth study. 


= constant coefficients in the expansions: 


B By og 
» NF. wo XtqXbq°° XN gi 


B 


by O(Bicnens CO 


—0o<, B..-WK<o~. 


@,(t) = any complete orthogonal set of functions preferably pos- 
sessing the group property 4,9, = $, 


D sigs = Constant coefficients in the expansions: 


E = constant coefficients in the expansions: 
OB ai ges 


Agito Ae = oy ee eee pC 


I. General Methods 


If a steady-state, fluid flow system is perturbed by the sudden 
introduction or removal of a non-negligible amount of material, the 
transfer coefficients and compartment concentrations will vary 
during the transient period of re-equilibration in accordance with 
the equations: 


N 
Right). aiyghjg = 8 HE ses ae (1) 

jintt 
Making use of the appropriate assay methods to determine X * 
and of the non-steady state tracer methods (Hart, 1957) to calcu- 
late Qi; q(t)s it is seen that all quantities in equations (1) can be 
found. If the experiment is repeated with a different perturbation, 
some of the resulting quantities a; @(*) and X, q’ (e)s (¢#q°), will 
be different although they also obey ‘equations (1). If there are no 
time lags, if diffusion and fatigue effects can be neglected, and if 
all other conditions can be kept constant, then it seems reasonable 
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to assume that the a,,_ will be differentiable functions of the X, . 
The problem of finding the characterizing equations of homeostasis 
therefore reduces to determining the coefficients in Taylor expan- 
sions of N variables such that equations (2) below will hold for 
all perturbations (i.e., all q): 


Giese Es aa oun (2) 
a,8...W=e—oo 


An equivalent expansion 


tijq ~ De F ijiap. he ie OP a 0 pee VN wee Tepe Aaa (3) 


where X,, is the steady-state value of the concentration in the zth 
compartment, might prove of value in some cases. Furthermore, as 
pointed out by A. Rappoport (1957) if the rank of the a;,; matrix is 
less than N, then steady-state conditions can occur with identi- 
cally constant transfer coefficients. 

Expanding @,;,, and X,, in terms of ¢,, substituting in equations 
(2) and grouping terms in ¢,: 


co 


 otas Bs >” Bijoh...cHaB..>Weaes? (4) 
O,B...-W=—0o 
where D iias and Ar are constant coefficients that in 


principle can be determined directly from the experimental data, 

If a,, are expandable in a finite number of terms of the form 
aexe pry Seg gag deh and X,, can be approximated by a finite 
number of terms ingd,, and if a sufficient number of independent 
stimulus-response patterns can be observed (by changing the 
perturbations), then equations (4) can be solved for Bj 548... 
The accuracy of the determinations will in general be satisfactory 
only if the expansions in equations (2) are simple, containing 
small exponents only and few terms. It has been pointed out 
that in the latter event it may also be possible in many cases to 
carry out the analysis completely using only a single tracer. If 
simultaneous determinations of all variables are possible, the ex- 
pansions in#, can be dispensed with, and the methods of Landahl 
(1954) can be used to determine @,,, and Biiag...e evaluated from 
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equations (5) below: 
C54 Ohm ye fg were oe oe Xe (5) 


OPENS ete 


Whenever experimentally feasible, the latter ‘‘direct method”? is 
probably better suited for practical determination of the coeffi- 
cients B ict Bove than the expansion method since small random 
errors can be aged by forcing the data to best fit a given set of 
terms (depending upon the particular model suggested by any sup- 
plementary knowledge) in the expansions of equations (2). What 
ever method is employed, after determining the coefficients 
Ba ict Bic. and substituting equations (2) in equations (1), it follows 
that: 


€+1 
> Bitte ee ee 1+eXy=0. (6) 


Using determinations of stable molecules only, lumped coeffi- 
cients of the terms Xf, Xf x8 etc., in equations (6) can be deter- 
mined independently from equations (6) either by the expansion or 
the direct substitution method. Since the separate similar terms 
arising from the substitution of equations (2) in equations (1) will 
already have been combined in the expressions obtained for equa- 
tions (6) using only stable molecules, it follows that, although 
equations (2) (obtained by tracer methods) uniquely define equa- 
tions (6) in fluid flow systems, the converse is not true in general. 
The partial effectiveness of non-tracer determinations is perhaps 
related to the observation of C. W. Sheppard and A, S. Householder 
(1951) that, using their method, stable material acts as a label for 
itself in the sense that when dealing with a physiologically occur- 
ring substance only N-1 radioisotopes are required to completely 
analyze an N compartment system. Stated another way, the limited 
amount of detailed information contained in equations (6), after 
similar terms in rexe etc,, have been combined, would seem to 
reflect the fact that measurements taken following a single injec- 
tion of an isotope, stable or otherwise, are inadequate to com- 
pletely solve a general N compartment, non-steady state system. 

It should be noted that the coefficients B,, O Bes. €...w IN Equa- 
tions (6) may actually depend upon various additional external or 
internal factors such as temperature, pressure, pH., or even vari- 
ous individually detectable but not fully characterized chemical 
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substances for which tracers might not be possible or convenient. 
So long as these factors remain constant they need not appear ex- 
plicitly in the analysis. 

As indicated above, a given coefficient in equations (6) may in- 
volve contributions from several separate transfer processes and 
the system under investigation will not in general be completely 
specified, The exact nature of the results obtainable using pure 
tracer, pure perturbation and combined methods of studying general 
multi-variable systems would seem worthy of further investigation. 


II. “Special Systems. 


In attempting to develop or solve the equations above, it is 
necessary that sufficient experimental data be obtained. Although 
complete characterization of simple systems might often prove 
possible using a single perturbation of any variable, it can be 
shown that for certain systems a perturbation of one component 
will provide little or no information about some of the others. As 
a trivial example consider a compartment in any unidirectional 
flow, non-feedback system such as occurs in radioactive decay. 
Clearly a perturbation in the amount of one component present can 
give no information about rates of decay higher up in the series. 
Even when feedback exists, a certain amount of difficulty can 
arise. Consider the model of Figure 1 in which the compartment 


Go 


I . 
feedback signal x f5(X3) feedback signal 


a be 


Qe 


a,=9 [te ( X2),f5(X5)] 


FIGURE 1 
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volumes are taken to be unit volumes. The equations correspond- 
ing to Figure 1 when no tracer is used are as follows: 


X,+4,X,+4,X,-a =0, 
X, ~a,X,+a4,X, =0, (7) 
X, —@,X, +0,X = 0. 


Although no fluid flow as such occurs from (2) to (1) or from (3) 
to’ (1);\/5 (4,) and 7, (*3) represent signals which together in any 
arbitrary way can regulate the transfer function a,. If X, =X, 
initially, no perturbation of X, alone can serve to determine f, 
(X,) and f, (X,) separately. 

Leaving aside special examples, it seems rather likely that com- 
plex biological systems will usually involve many separate feed- 
back and flow loops and that a simple perturbation introduced at 
one point of a lengthy cycle will be smoothed out or damped before 
it effects many steps. Determining the minimum number of per- 
turbations required for a full investigation of a given system (i.e., 
a ‘‘complete’’ set of perturbations) might prove to be an interesting 
problem. 

If the X , and their products of interaction act as enzymatic regu- 
lators of the transport process, then functional dependencies can 
arise of the form: 


Dias pk ee (8) 
ij Lt P(A Ag ees oO) 


where P is any polynomial. The right side of equations (2) may 
then be divergent infinite series even though in the range of con- 
centrations considered the a,, are well behaved. It might be of in- 
terest to see if the methods outlined can be adapted to treat such 
cases as well, 


It is a pleasure to thank Dr, Daniel Laszlo for his stimulating 
encouragement and support of this work. I should like to thank 
Professor Herbert Landahl for many excellent suggestions and 
comments which were very useful in clarifying the relations be- 
tween equations (4) and (5). I should also like to thank Professor 
Robert Hatcher and Dr. Mones Berman for their careful review of 
the theory and for many helpful suggestions, 
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ON AGE DISTRIBUTION IN POPULATION GROWTH 


ALLADI RAMAKRISHNAN AND S. K. SRINIVASAN 
INSTITUTE FOR ADVANCED STuDY, PRINCETON, NEW JERSEY; 
AND DEPARTMENT OF PHYSICS, UNIVERSITY OF SYDNEY, 
SYDNEY, AUSTRALIA* 


The statistical problem of age distribution in population growth in- 
volves the well known difficulties in mathematical probability of defin- 
ing the distribution of a discrete number of randem points (individuals) 
in a continuous parametric space (age). The assumption of the possi- 
bility of multiple births makes the problem more complicated, and we are 
constrained to introduce the concept of multiple points. This leads to an 
extension of the method of product densities devised earlier for the treat- 
ment of population problems. The paper deals with such an extension 
and as an example of the application of this method the population prob- 
lem with twins is discussed. 


1. Introduction. The statistical problem of the age distribution 
in population growth raises the well-known difficulties in mathe- 
matical probability of defining the distribution of a discrete number 
of random ‘‘points’’ (individuals) in a continuous parameter space 
(age). Similar problems arose in the cascade theory of cosmic ray 
showers, and it is not surprising that a common technique has been 
devised by various authors (Kendall, 1949; Ramakrishnan, 1950; 
Bhabha, 1950). This technique consists in defining densities of 
various orders (known as product densities) in the continuous space 
and relating the moments of the number of points lying in a finite 
‘interval’? or ‘‘volume’’ in the space, with these densities. A 
comprehensive summary of this method is given in M. 8. Bartlett’s 
book on stochastic processes (1955). 

The basic assumption of the above-mentioned method is that the 
probability that n random points lie in an infinitesimal interval (or 
volume) dt is O(dt)” and hence only the probability that one ran- 
dom point lies in an infinitesimal interval contributes to the mo- 


*Present address of both authors is the Department of Physics, Uni- 
versity of Madras, Madras, India. This paper was written while on leave 
of absence from there. 
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ments. On this assumption the problem of population growth with 
“‘twins’? and ‘‘multiplets’’ cannot be dealt with since such ‘‘mul- 
tiplets’’ will have the same age. Thus if we wish to include such 
‘multiplets’? we have to extend the formulation of product densi- 
ties to include ‘‘multiple’’ points. Though this extension may not 
be so important in human populations, it is hoped that in other 
genetical and multiplicative stochastic processes it may find ap- 
plication. Hence we first formulate the extension in a general 
manner and then apply it to the particular case of population growth 
including ‘‘twins.’’ Neglecting ‘‘twins’’ the discussion, of course, 
reduces to that of D. G. Kendall (1949). 


2. Extension of the Product Density Technique. Let M(z) be the 
stochastic variable representing the number of random points with 
parametric values less than x (2 for example may be the age of an 
individual of the population or the energy of a cosmic ray particle). 
Then dM(ax) is the stochastic variable representing the number of 
random points in the interval between z and z + dz. We shall as- 
sume that in the elementary range dz the maximum number of ran- 
dom points which can le with probability O(dz) is n so that the 
probability of occurrence of n+m(m21) is of order O(dz)/, 
2<j78n+m. If & points occur together in dz with probability 
O (dz) we shall call them collectively a k-tuple point. We decom- 
pose the stochastic variable dM (z) as 


dM (x) = dM (2) + 2dM@ (a) +... + dt (2), (1) 


where dM‘(z) is the stochastic variable representing the number 
of i-tuples in the range dz. For defining product densities we treat 
the multiple points as different types or ‘‘species’’ of points and 
specify the densities for each ‘‘species.’’ Thus we can define 
f(a), f(a), ..., fF? (@) as the densities of the first order of 
single, double, ..., and n-tuple points respectively. From the 
theory of product densities we know that 


E (dM (w)} = f(a) da, (2) 


where E denotes expectation value. Therefore 
E {dM (a)? = a nf (2x) de , 


E{M (2)} = in Yn (2) de. 
0 n 
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Higher-order product densities are introduced by considering the 
expectation value E {dM (a,)dM(x,)... dM(a,)}. In particular 


E {dM (z,)dM(x,)} = 


E ye dM(z) 2a | = 
i= j= (4) 
2S iff? (w,, «,)de, de,» 


i=l j=l 


where feP:, z,) dz,dz, is the joint probability that an 7-tuple 

lies in dz, and a j-typle in dz,. ee is a mixed product density of 

degree two of i-tuples and j-tuples. Thus we have n® different 
product densities of degree two and in general n™ different product 

densities of degree m. 

Hence the extension of the theory to multiple points consists in 
considering the multiple points of different orders as different types 
of points and viewing the product densities accordingly. Instead 
of dealing with dM (zx) we study the individual dM (x) of which it 
is composed. In doing so we must emphasize the distinction be- 
tween a k&-tuple which occurs in dz with probability O(dzr) and & 
points which consist of two or more multiples, say, one z-tuple and 
j-tuple such that 7 + j = k, the probability of the joint occurrence of 
which in dz is O(dz)*. But as regards the development of the 
process and the enumeration of the points, the ‘‘members”’ of the 
multiplets are treated just the same as the “‘singlets.”’ 

We do not give here the formal expression for the general rth mo- 
ment of M(z) in terms of tHe product densities of the multiple points 
since it is too elaborate to be of any use but content ourselves 
with the method of arriving at the expression: 

1. First expand [M(#)]’ in terms of the dM“ (a,) using equation 
(1). Thus M(z) is a sum of a series of products of terms dM (a;). 
2. Replace every power of any dM“(2;) by dM“(a;). This is 

due to the degeneracy 


E {lam (a yk} —> E fdM? («,)} > Ae ydew. (5) 
3. Recognize that 


Blan (,)dM(2,) ... d''(@))} > 
fftrrtare D(x, ae eae w)) dx, dz, Bb dz), 


(6) 


9? 
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where aia cats ID (a, @,, +++, 2) is the mixed product density of 
degree 7 such that the term (6) represents the probability that one 
i,-tuple lies in dz,, one z,-tuple lies in dt,,..-, and one z,-tuple 
lies in dz,. 

4. Replace the summations over the z,’s by integrals over the 
range defined by M (a), i.e., 0 to z in this case. 

In the simple case when no multiple points are allowed we get 
the expression obtained in the original formulation of product 
densities 


E f{[M(2)1"3 = 0 if. sf tu so 2) de. one 


where the C! are the coefficients determined by expanding N’ as 


N=)" CIN(W-1)...(N=8 +1), (8) 


s=1 


N being a positive integer. 

The coefficients of the integrals in the case when multiple points 
are included cannot be expressed elegantly but are uniquely de- 
termined by the rules stated above. 

For the particular case of the second moment when only single 
individuals and twins are allowed we have: 


E {[M (x)]?3 = [AP )ae, + 4 [1 @,) ax, + 
0 0 


x x x x 
2 | if ace w,)dx, dx, + a! | [eta v,)dx, da, . 
0 /o 0 Yo 


3. Formulation of the Population Problem With Twins. Kendall 
formulated the population problem in the following manner. We are 
given that there is a single individual of age a at time t= 0. The 
single individual and its subsequent descendants generate a popu- 
lation in accordance with the following rules: 

(i) The subpopulations generated by two coexisting individuals 
develop in complete independence of one another. 


(9) 
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(ii) An individual of age z existing at the time ¢ has a probability 
A, (x) dt + O(dt)? 


of producing a single individual of age zero between ¢ and ¢ + dt. 


(iii) An individual of age x existing at the time ¢ has a proba- 
bility of extinction or death 


u(x) dt + O(dt)?. 


(iv) The birth and death probabilities A, (v) and p(x) depend only 
on the age 2 of the individual and not on ¢, the time of their 
existence. 

In addition to these assumptions of Kendall, we introduce the 
following: 

(v) An individual of age 2 existing at the time ¢ has a probability 


A, (@) + O(dt)? 


of producing a ‘‘twin’’ of age zero between ¢ and ¢ + dt, this proc- 
ess being independent of (ii), and A, (x) being independent of ¢, 
(vi) The two individuals constituting twins behave individually 
after production. 
We wish to obtain the first two moments of the number of individ- 
uals above a certain age at any time ¢. We shall assume through- 
out that p is independent of age. 


4. Mean Behavior of the Population. In view of the assumption 
(v), we have a probability O(dz) of finding two individuals of the 
same age between z and z + da at any particular time. Hence we 
introduce f((«; ¢) and f(a; t) as the first-order product densities 
of single individuals and twins respectively; i.e., (a; t) dx is 
the probability that an individual has an age between z and x + dx 
at time ¢t, and f?(#;¢)dz is the probability that two individuals 
have the same age between « and z + dz at time ¢. 

We next proceed to obtain the equations satisfied by { (#2) 
and f@(2;t). Since we want an individual of age @ at time ¢, it 
must have been born at time (¢- 2). A single individual can be 
produced between (¢ — x) and (¢ -# + dx) by 

(i) either the primary individual; this occurs with probability 


e HUE“) (a+t-a)dz, (10) 


(ii) or a secondary which may be a single individual or one of 
twins; this occurs with probability 
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dx ma t-2)d,(u)du + 
0 
(11) 


2] (Pew ear, (9ae), 


The factor 2 occurs since any one of the twins may produce the re- 
quired individual. 

(In (i) and (ii) if we multiply the corresponding birth probabilities 
by the survival probability e-“*, we obtain the probability that a 
single individual produced under circumstance (i) or (ii) between 
t—a# and ¢—2x + dz survives up to ¢.) 

(iii) There is also another possibility for a single individual to 
exist in the age group (z, 2 + dz) at time ¢, namely, by the produc- 
tion of a twin under circumstance (i) or (ii) and the consequent 
death of any one of the twins, the other surviving up to ¢% The 
corresponding probability can be obtained by replacing 4, by A, in 
(i) and (ii) and multiplying by 2e **(1 -— e 4*) which is the proba- 
bility that one of the twins dies between ¢ — 2 and ¢ and the other 
survives up to ¢. Thus f‘"(a; ¢) satisfies the equation 


f(P (a, tl) =A, (a+t—ae 4? +20, (a+t-a)e #1 -— e #*) + 


t—x 
i f(a, as: z) la, (u) e7#* “ 2d, (u) (1 ~ e #*) e #*) du * (49) 


t—x 
2 | f(u, t - x) [A, (u)e#* + 2d, (u) (1 — ec #*) e#*] du. 
0 


Using the same arguments as for (12) we obtain 
f(z, t) = ro (a+t-2) ett x) 


[ t — 2), (u)e” 7#* du + (13) 


t—x 
2 | f(u, t- av) dr, (u) e 24#* du. 
0 


We can obtain explicit solutions of (12) and (13) when the birth 
rates are assumed to be age independent. Setting A, (@) = A, a con- 
stant and A, (z) = A, a constant, we observe that 


ff? (@, deM*(A, +20, -2A,6 #7)? 
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and 
f(x, t)e™#*A5! 
are functions only of ¢-— z. Accordingly we make the substitutions 
i> (a, @) =e P(A, +.2A, -2dA,€ **)¢, @- 2), (14) 
f(x, t) =e PA, 6, (¢- 2). (15) 
Then ¢, and ¢, satisfy the equations 
t 
$,(@)=1 +{ (Ag+ 2A, °- 2QaA,e **)d, (t-u)dut 
: (16) 
t 
2d. | d,(t-uje Bidu, 
) 


p, (t) = eh" +fema, +2, -2d,€¢ *")h, (6 -uan+ 
: (17) 


Dry [ sa(e- au. 
0 


From (16) and (17), it is easy to recognize 


oa (te = 4,(4)- (18) 


Substituting (18) in (16) and solving the resultant simple integral 
equation, we obtain 


b(t) = estas, (19) 
Thus 
foP (a, t) = eT PEA, + 2A, — 2AQe M*) e(Mt2 2-2), (20) 


(Pe, t)=X, grb te x)+ Ayt hart), (21) 


If we put A, = 0, f7 (A; 2) becomes zero and (2) reduces to the re- 
sult obtained by Kendall. If we set A, = 0, we obtain 
f(a, t) =e M2A,(1— 6 #*) pare 2)., (22) 


f(a, t) = bent TEES peed (23) 


Equation (22) shows that there is a positive contribution to age 
even though we set A, = 0. Of course this is to be expected since 


the twins behave independently. 
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Equations (2) and (21) provide any kind of information we re- 
quire regarding the mean behavior. For example, if we are in- 
terested in the mean number of singles or twins at ¢ with their ages 
lying in the interval (2,,2,), we have merely to integrate the corre- 
sponding f over the range (2,, Z.). 


5. Fluctuations in Age of the Population. If we want to study 
the fluctuations, we have to introduce the second-order product 
densities. We have now four different product densities of degree 
2: FO Dee), (Oe Dey @ 4 ee eee 
fS'') (a, y; t) dwdy is the time joint probability that there is an in- 
dividual of age between x and 2 + dz and an individual of age be- 
tween y and y + dy. We can similarly interpret the other functions. 

We now proceed to obtain integral equations satisfied by the 
product densities of degree 2. We shall assume x<y<t. As be- 
fore we shall derive integral equations when A’s are age-specific 
and obtain their solutions for constants 2’s. 


ASP (wy, 0) =f Oy — 2 t- a)e(y — a, we P* + 
2A? (y — 2, t-2)(1-e M*)e B*e(y — 2, 2) + 
K'(y-@,t-a)e4*e(a+t—2, 2) + 


2K*(y—2,t-ze *(1-e #*)e(a+t-—2, 2) + (24) 
t—x 
fe e* e(u, x) (fh? (u, y-—a@, t-2) + 
0 


OY etd Cs y-2@,t-a)+ 279° (u, y-2, t-a)(l1-—e 4*) + 
4f2-(u, y—a, t-2)(1—e7#)] du, 
where 
e(u, v) = [A, (wu) + 2A, (u)(1-e #*)] (25) 


and K'(é; n)d&[lor K*(é; n)dé] represents the probability that 
there is a single individual (or a twin) in the age group (€, € + dé) 
with the primary alive at n(n > &). Of course, we have to write 
integral equations satisfied by the auxiliary functions K' and K? 
and solve for them, Equation (24) is derived in the following manner. 

Since we have assumed @ < y <¢, at time ¢ — z, we must have at 
least a single individual of age between y — x and y — x + dy, and 
it is easy to see that every term on the right-hand side of (24) con- 
tains the probability of such an event. The individual that has an 
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age between z and z + da at ¢ must be produced at ¢ — a by 

(i) the other secondary of age y at ¢, i.e., y — @ at time ¢ — 2, 

(ii) or the primary, 

(iii) or any other secondary. 

The first two terms of the right-hand side of (24) correspond to 
(i) and the third and fourth terms correspond to (ii) while the last 
term corresponds to (iii). The function «(u, x) has a simple in- 
terpretation. ¢(u, z)dx denotes the probability of observing a sin- 
gle individual of age between 2 and « + da at time ¢ given that it 
was produced between ¢- 2+ dz and ¢t,¢-@ by an individual of 
age U. 

Using exactly the same arguments we obtain: 


fe (a, y, t) = f(y -2, t- z)e~?#*De(y — 2, &) + 

k*(y - 2, t-—a2)e *H* e(y—a, t - 2) + 

t—-x 26 
i e(u, ze 7H* [Ff (u, y — 2, 6-2) + oe 


0 
2f£(u,y—a2,¢- a) du; 
2D (a, y, t) = fP(y — 2, t- #)2Ay(y - ae *P* + 
K?(y—2,t-2)A,(a+t-a2)e *H* + 
t—x 27 
I [f5-?(u, y-@, t- 2) + oe 
0 


DF OE Car, y-@,t- a)] ro (uw) 21 7B* dies 


fo (2x, VE t) = Ley —@,t— x) + 
a(1 —e M*) f(y -—2, t- 2A, (y - 2) e oH* 4 
[K'(y-2#,¢-2)+2(1 ~e*)K?(y —2,t- a))A, (a + t - 2) e BHM +4 


t-x ee (28) 
[ [fLeP(u, y- 2, t- 2) + 2fy D(u,y — 2, t- 2) + 
0 


of<(u, ya, t-2)(1-e #*) + 
46% (u, y — 2, t- 2)(1— e HAYA, (uw) oO" 1d | 


Our next task is to obtain K *(a, ¢) and K* (2, t). From the defi- 
nition of the K’s and using elementary probability arguments that 
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are now familiar, we obtain 


K\ (a, t)=e He(a+t—2, 2) + 


[RG t-ax)e '*e(u, z)du+ (29) 


0 


t—%x 
| K?(u, t —2)2e #* e(u, x) du, 


0 


K? (a, th=A,(a+t — ig) eg ihhs 77) 4 


t—x 
2 [ K?(u,t-2)dr,(uje °#* du + (30) 
0 


t—x 
i K'(u,t-a)A,(uje *#* du. 
0 


Explicit solution of these equations is possible only when we as- 
sume that \’s are not age dependent. As assumed in the previous 
section, let A,(z) =A, and A, (z) =A, where A, and A, are con- 
stants. Then e(u, x) no longer depends on u, and hence we write 
e(u, 2) as e(a). We observe that K'(a, t)e'/e(x) and 
K? (a, the#“***/), are functions only of t- 2. We therefore make 
the substitutions 


K' (a, the*? = e(2)Q, (¢-2), (31) 
K? (a, t)e#O+™) = 0, (¢ — 2). (32) 


Substitution of (31) and (32) into (29) and(30) yields integral equa- 
tions for 0, and , which are easily solved. 


1 


NGO UO ie pad 


x A, take het ead eso aie aes 
2 


Hence K' and K? are given by 


ke wv, ¢). = Bese e bt » aX (Ayt 2227 uw) (t— x) ae 
(x, ¢) v2. oe [(A, + 2A,)e ul) (34) 
= Vaan ot 2x) aleus 

K* (a, t) = —“______[(A, + 2A 1+ 2A. MXt-x) _ 
(2 4) \okhas on a an H e 


If we put A, = 0, (5.26) gives the solution for K the auxiliary func- 
tion introduced by Kendall. Observe that K'(x; t) 40 even if we 
put A, = 0. 


A on ey ce te ae Wake 
- * 
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We are now in a position to solve the integral equations (24), 
(26), (27), and (28). In those equations, the arguments wu, y - 2, 
t — zx of the integrand are not ordered, while 2 < y <¢ on the left- 
hand side. We can do so by using the definition of the product 
densities and the range of integration. We can illustrate this by 
considering 


t-—x 
| f02D(m y-—2, t- 2) du. 
0 


The range of integration can be split up into two parts (0, y - a) 
and (y — 2, t — x), and we can write 


t—x 5 Aaa 
| f2°-Y(u, y — 2, t- 2) du =| f8*O(u, y- a, t-2)du+ 
fe) 0 


(36) 
IP PTEIG ~2z, u, t- 2x) du. 
yx 
From (26) and (27) it follows that 
S&P? (a, y, Ase MHF = fy? (@, y, He(2)- (37) 
Similarly we obtain from (24) and (28) 
FED (a, y, the PMA, = fg" (@ Ys Hele). (37°) 


Thus though we have a system of integral equations involving four 
functions it is now effectively a system involving two functions 
only. We shall take up equations (27) and (28). Substituting (37) 
and (38) into (27) and (28) and using (36), the equations reduce to 


pee?) (ar; y, t) =r, K*(y ~2,t—2) +20, fi(y-2,¢-2) + 


ee 
(A, + 222) | tt CP Boal a) edu + (38) 
0 


t—x 
Ao / Ley =a, u,t—2)+ 2 AS (y -— a, u, t- a)| du, 
y—-x 


poe er, Ys t) =\,K'(y-2, t — 2) 3 tee {yo =e, t — 2) + 


9 (1 — e7H*) ££ (@, y, t)e*h* + 


¥—-Xx% 
u ¢ (2,1) Ey oe, 
i (Ader 8 Kasra (u, y - 2, t x) du + (39) 
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t—x 
A ely ~ 2) e2MU-DEFAD (y — 2, u, t- 2) + 
a 


2fL)(y-2, y,t-a}du. 
The form of the equations (38) and (39) suggests the substitutions 
e8Hx _9(1 ~e7#*) e 4H* £533) (o, y, t) = f(y — 2, t — 9) 5 
e *B* FLA) (e, y, t= d(y—2,t-y). (41) 
Substituting (40) and (41) into (38) and (39), we obtain 
b(u, v) =A, K?(u, ut+v) + 2dr, Ff? (u, wt v) + 


(Aqct 223) [ed —1, v)drt+ (42) 
0 
agente FAC v-7n)+2d(7,0- r)tdr, 
F 0 


w(u, v) =A, K*(u, w+) + AS f{P(u, w+ v) + 


(A, + 2A,)d(u—z, v)dr+ 


(43) 
ro oveue fu(r, v—1)+2d(1, v-—Dhdr. 
) 
Defining g(u, v) as 
g (u, v) = 674 iy (u, v) + 2¢d(u, v)}, (44) 
we obtain from (42) and (48) 
g(u, v”) =A, e7F"[K!(u, w+ v)+2K?(u, ut) + 
f{P(u, wt v) + 4fO(u, w+ v) + (45) 


(A, + 2A,) [ g(r, oar g(t, 0-67? ae 
0 


0 
Thus 


dg (u, v) = 
a Ay HB Ag) AQ meMBrPC FANT) 4 (A, + 2A.) 9(u, 2), (46) 


Y 


—Te ea ke 
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and so 
yee ae ee 


g(u, v) =F, (v) ert 2A2)e _ 
NG + 2X. —p 


(47) 


Introducing (47) into (46) and solving the resulting integral equa- 
tion for F, (v), we find 


F (2) = 
(Ae 2A, —p) 
(48) 
Hence g(u, v) is given by 
ro e Art 2Aa7 w)Y 
u, ¥) = ———————_ 12(A, + 2A,) E 
g(u, v) array {2(A, JQ 3s) 
eat 2hQ)(utvy—pe = nr, + 4d,)¢ Ot aha al (49) 


— pr, +2d,)e4"$. 
Equation (49) together with (42) yields an integral equation for ¢. 
This is solved in exactly the same manner as (45); 


2 eat 227 wu % 
—————————EE A, ad p) es* Pha bL)v ape 


2 

Q, + 2A, — p) (50) 
2dr? u 

A, + 2A, —B 


o(u, v) a 


ErecsaA, yet ths He} — oat? ham Mv Buu 


Equations (49) and (50) constitute the solution of the problem. 
The various product densities are given by 


e(aye H™ A 
ten t= Vee 
c gly -2, (51) 


2e h* dy — a, t-y)l, 


tS (e, Ys t) me 


€ Tid ane 
fe? (e, Ys t) = Oo sy-2,t-u); (52) 
2 
{P(x y, =e H*[g(y-2, t- ye RO? — : es 
2d(y-2,t-ye **), 
fi?” (a, y, t)= e {BX d(y—a@, t- @) « : (54) 
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With the help of equations (51) to (54), we can obtain any kind of 
information regarding fluctuation. 
Putting A, = 0, we note that 


fOD = f@D = f(D = 0 (55) 


and 


2 
pe (es y, t) = “ oe 2uttry(t-y) for, eh (t—x) _ 


pe mya tBCRHy) _ 7 gHGH =} 


which can be identified with equation (114) of Kendall, if we 
recognize that the cumulant function y(z, y, t) and f{"(z, y, 2) 
are connected by the relation 


fntas y, t)= hi (z, t) f, (y, t) + y(a, y, ¢). (57) 


The second moments and the mean square deviation are obtained 
by using (9). The mean square of the total number of twins is 
given by: 


2r2 iph 
E {N(z)} = ere e *#¥(a, +a, +4,) + 
> (eCArt2ra-p)t _ e~2ut) (58) 
2 
Ay + 2A, +p 
where 
t 
a, = Q, - 7) pe eS PR e (Mut 2r,t sure Se 
A, +2A,+ 3p 
MIN 
1+ gtou 
<a 20a, + 8A,) oe eat” ae (ert 2rat aye _ ne 
: A, +2A,-u [A, +20, +34 
(59) 
NEVES Turani chin tines »| : 
1 SAg hp 
2 1 
a, = Cee E (e7HF — 1) 2, 
A, +2rA, +4 | Qu 


1 


ei eS (eArt2rgts mye os »| ’ 
A, + 2A, 4+ 3p 
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REMARK ON SOME CORRESPONDENCES BETWEEN THE 
CHARACTERISTICS OF THE NERVE IMPULSE AND 
THOSE OF FERROELECTRIC POLARIZATION 
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Ferroelectric polarization currents have characteristics which may be 
of use in the study of the nerve impulse either directly or indirectly 
through the suggestion of parallel experiments in the two domains. For 
example, the ‘‘all or none’’ aspect of the excitation of the nerve impulse 
can be considered to correspond in some sense to the coercive force re- 

_ quired for the reversal of a ferroelectric polarization. The refractory 
period then corresponds to the lack of a relaxation process in ferroelec- 
tric polarization; for a second application of voltage in a given direction 
does. not produce a ferroelectric current; it is necessary that the polari- 

_ zation be reversed by a reversal of voltage. The ‘fheat block’? in nerve 
conduction may correspond to a ferroelectric Curie point at about 40° C. 
Velocities of propagation of ferroelectric polarization are of the order of 
magnitude of the velocities of sound and therefore in a range suitable for 
the interpretation of the observed velocities of nerve impulses. These 

_and other parallels between ferroelectric behavior and the characteristics 

of the nerve impulse suggest that there may be a useful degree of simi- 
larity between the molecular processes responsible for the nerve impulse 

_ and those responsible for ferroelectric polarization. 


_ The purpose of this paper is to point out some correspondences 
' between the characteristics of the nerve impulse and those of 
ferroelectric polarization. These correspondences may simply 
- mean that the two processes resemble each other in some aspects 
of their overt behavior, though the basic molecular (ionic, elec- 
f tronic) processes leading to this behavior may be quite different in 
the two cases. On the other hand, it is possible that the resem- 
ee in overt behavior reflect significant similarities in the 
molecular processes underlying ferroelectric behavior and the 
é nerve impulse, respectively. Further study is required to determine 
which of these is the actual situation, but in the meantime it may 
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be of interest to compare the two processes by citing a few ex- 
amples of the correspondences between them. The comparison is 
made in terms of a tentative, qualitative model which is not de- 
scribed in detail but merely indicated. 

The ‘‘all or nothing’’ aspect of the excitation of the nerve im- 
pulse is certainly one of its most characteristic features: to this 
corresponds the coercive force of ferroelectric polarization. [While 
dielectric polarization forms to a degree proportional to the ap- 
plied voltage however small, ferroelectric polarization does not 
form (appreciably) until the applied force equals the coercive 
force. This behavior strongly distinguishes ferroelectric polari- 
zation from dielectric polarization and from ionic conduction. ] 

The refractory period of the nerve is referred in the present 
model to another characteristic feature of ferroelectric behavior, 
namely, the fact that a ferroelectric can not be polarized further in 
a given direction until a counter-voltage has been applied to re- 
verse its direction. (A dielectric polarization would relax in the 
course of time, but a ferroelectric polarization has no relaxation 
process.) On this model a reversal, or re-setting, of the ferroelec- 
tric polarization must occur before another impulse can be con- 
ducted after one has passed. The reversal of the ferroelectric 
polarization is referred to the relaxation of a dielectric polariza- 
tion which forms simultaneously with the ferroelectric polarization. 
(While ferroelectric polarization is treated as the central process 
in the model, dielectric polarization and ionic conduction are intro- 
duced as secondary processes which combine with ferroelectric 
polarization to give the impulse its observed characteristics.) A 
calculation of the velocity of propagation of the impulse in this 
type of model leads to a result in order-of-magnitude agreement 
with the observed velocities of propagation of the nerve impulse. 
The velocity-determining factor is taken to be the rate of propaga- 
tion of the ferroelectric polarization; this is observed in ferro- 
electrics to be of the order of magnitude of the velocity of sound 
in the material. 

Conduction of the impulse with attenuation, after a section of 
nerve has been narcoticized, is referred on this model to an in- 
crease in the coercive force produced by adsorption of the nar- 
cotic, A ‘‘heat block’’ has been observed in the propagation of 
the nerve impulse. On the present model this would be due to the 
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presence of a Curie point in the ferroelectric at about 40° Centi- 
grade. 

The above is presented as a sampling of the correspondences 
between the nerve impulse and ferroelectric polarization proc- 
esses. Further study is required in order to determine whether the 
correspondence is complete or only partial. If it proves to be par- 
tial, it seems to the writer that it is still of interest that the two 
processes have a considerable amount in common. 
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Morphologically identical nerve endings subserve more than one sen- 
sory modality. Two possible explanations are: such endings are func- 
tionally different, responding to one type of physical excitation only, or 
such endings are capable of responding to more than one type of physical 
stimulus, but central mechanisms discriminate between different stimuli. 
A simple network is described to illustrate how thermal receptors could 
discriminate between thermal and mechanical stimuli. 


Many sensory receptors respond to more than one type of physical 
excitation. For example, the infrared receptor of the pit viper re- 
sponds to mechanical excitation (Bullock and Diecke, 1956); the 
ampulla of Lorenzini responds to thermal changes and also has a 
mechanoreceptive function (Murray, 1957); some mammalian mecha- 
noreceptors respond to thermal changes (Hensel and Zotterman, 
1951). Morphologically identical free nerve endings in hairy skin 
(Lele, Weddell and Williams, 1954) and in the cornea (Lele and 
Weddell, 1956) subserve the modalities of touch, cold, warmth, and 
pain. Two possible explanations can be given for the origin of the 
different modes of sensibility in such cases: (1) Morphologically 
identical free nerve endings are actually divided into functionally 
different classes each of which responds to one and only one type 
of physical excitation. (2) The same free nerve terminals in the 


*This research was supported in part by the United States Air Force 
Office of Scientific Research of the Air Research and Development Com- 
mand under contract No. AF 18(600)-1454. Reproduction in whole or in 
part is permitted for any purpose of the United States Government. 
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skin which subserve warmth and cold sensibility can, if stimulated 
in the appropriate way, also give rise to the sensation of touch, 
prick, itch, and sharp pain (Lele, Weddell and Williams, 1954). We 
shall consider only this latter explanation, discussing it in terms 
of both the discrete and the continuous theories of neuron inter- 
action. 

A simple network of McCulloch-Pitts neurons can be constructed 
to illustrate how peripheral mechanisms could discriminate between 
thermal and mechanical stimuli. Figure 1 shows such a network. 
The open circles represent inhibition and the closed circles repre- 
sent excitation. All second-order neurons are assumed to have a 
threshold of two. Each warm and cold receptor, indicated by W 
and C respectively, contributes an afferent branch to a third neu- 
ron P. Since thermal changes evoke opposite responses in the ac- 
tivity of the two types of receptors, e.g., a rise in the skin tem- 
perature increases the spontaneous activity of warm receptors and 
inhibits the activity of cold receptors, any thermal change at the 
skin surface would evoke a change in the output in W, or C, but 
not in P. If the peripheral terminals of W and C are also able to 
to respond to mechanical or destructive stimuli, simultaneous ex- 
citation of W and C would inhibit activity in W, and C, but the out- 
put in P would be proportional to the sum of the activities in W 
and in C. 

These results can be formulated mathematically as follows. The 
average frequency of response of W in Figure 1, v,,, increases with 


FIGURE 1 
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temperature T over a physiological range. Over a somewhat smaller 
range we can assume that this relation is linear. Similarly over 
some range of pressure values, v may be assumed to be linear 
with the pressure P. Hence if T,, Be; and y, are constants, then 
for T > T, , 


me eal A), yk. (1) 


The quantity 7, is the value of the temperature at which the re- 
sponse frequence of WW extrapolates to zero. 

We would expect that there would be a similar relationship be- 
tween the frequency v, of the neuron C and the temperature and 
pressure except that the frequency increases with decreasing 
temperature. Hence if T,, 285 and y, are constants, we may write 


v= PB (T,-T)+y,P- (2) 


The quantity 7, is the value of the temperature at which the re- 
sponse frequency of C extrapolates to zero. 

If we suppose that there are higher levels W,, W,,..-; and C,, 
Cy, +++, Of interacting elements in two neuron chains starting re- 
spectively at W, and C, in Figure 1, as considered previously 
(Williams and Landahl, 1958, Figure 1), then the output frequency 
of C chain approaches vy, -—v,,ifv, >. 

Let us suppose that the neurons W and C are similar in the 
sense that ie = B, and y, = y,,- Introducing 


Ty = $(Ty -T,)s (3) 

we then have from (1) and (2), 
v,—v,=2B(T-Ty), T>Ty- (4) 
Thus the output of the W chain is linear with temperature but inde- 


pendent of the pressure. When vy, > v,,, the output frequency for the 
W chain approaches zero while that of the C chain approaches 


Toone” 


y,-v,=28(T,-71), T>Ty- (5) 


Thus, 7, is a temperature at which the output in both chains ap- 
proaches zero, this temperature corresponding to a neutral tempera- 
ture. The output of the C chain, given by (5), is also independent 


of P. 
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On the other hand, the frequency v, of neuron P will be given by 
the following value, unless the frequencies are too high (Landahl, 
McCulloch, and Pitts, 1943), 


vi=v, +, =2BTy, +2y,P. (6) 


This value of v. is independent of 7, depending only on the pres- 
sure P, It should be noted that, in the case of pressure, no inter- 
acting chain is needed. But whereas continuous background ac- 
tivity in C and W is largely eliminated by the chains, C,, C,, 
Cy,---, and W,, W,, W,,..., which interact, this continuous ac- 
tivity adds to the output of P. We have assumed that B, = B, and 
Ye = Y,+ If these relations do not hold the results are modified. 
This possibility will be discussed subsequently. 

If the warm-cold interaction is considered in terms of the macro- 
scopic continuous model a number of interesting features can be 
accounted for. Consider the simplest case of symmetric inhibition 
in Figure 2. Let the excitatory and inhibitory factors produced by 
a receptor be denoted by e and j with o =« —j representing a net 
excitation. Using the notation of Householder and Landahl (1945) 
we have 

de dj : 

Gib o T=dRS-A, (7) 
assuming that ¢ and wy are proportional so that J = Rd. Now for 
warm receptors referred to by subscript w, we may write 


ba = As i tye (8) 


Saas 
®/ Warmth Center 


, 
/ 


a6 


Touch Center 


Cold Center 


FIGURE 2 


No 
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the quantities B,, and y, being the same as Ch and y,, of equation 
(1) except for units. Similarly we can write the following expres- 
sion for the cold receptors: 


To simplify matters let the coefficients a, 6, B and y be the 
same for both kinds of receptors, i.e., 8, = B,, = 8, and let thresh- 
olds A,, h,, and A, of the neural elements of Figure 2 be negli- 
gible. Consider only temperatures between 7, and T7,,. In the 
steady state we obtain the following expressions from (7), (8), and 
(9): 

eee eee) A(T —T,) «(1 — kh) yh, — (10) 


o,, =fe. ~j-)o = - FR) 8 (Ty -T)+GA-F)yP. (11) 


The net excitation at the center for warmth, o, which is o,,, - 


co 


Jeoo: 18 then given by the following expression which is the nega- 


tive of the net excitation o,,, at the center for cold: 


ee gg PAT —T,) (12) 


woo 
T, being given by (3). If T > Ty, a, is positive while o,,, is 
negative and hence produces no response. On the other hand, if 
T < T, the reverse is true. Note that the net excitation depends 
on temperature but not on pressure. 
The net excitation o,,, at the touch center in the steady state is 
given by 


Cpa = 7, -—o,,, = B(1-R)(Ty - TL) +201 -R)yP. (13) 


The net excitation depends on pressure but not on the temperature T. 

Now suppose that the temperature is suddenly changed from T to 
T + AT, where AT is not necessarily small. After integration of 
(7) and substituting (8) and (9), we find the following expressions 
for the excitations at the warm and touch centers when T > T,: 


Gg Oats ge ABAT (= eSh= Ba Res"), ae 


Bp (0) = Fae am 


Hence, while there is no change in the excitation at the touch cen- 
ter there is a transient and a permanent change in the activity of 
the warm center. If we start at a temperature T < Ty, we can ob- 
tain a result similar to (14). In Figure 3 typical results are sche- 
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FIGURE 3. Curve I, 7 > Ty and AT > 0; Curve I, 7 > Ty and AT <0; 
Curve Ill, 7 < Ty and AT > 0; Curve IV, 7 < Ty and AT <0. 


matically illustrated. Note that if T is fairly large compared with 
Ty (neutral temperature) and if the sudden decrease in temperature 
is not too large there may be a marked reduction in the sensation 
of warmth which is transient (Curve II, Figure 3), while if the 
change is large enough there may be a cold response even though 
the temperature remains above T,,. If initially the temperature is 
sufficiently colder than 7,, and a large enough increase is intro- 
duced, then these may be a warmth response even though the tem- 
perature remains below 7, (Curve III, Figure 3). 
If instead of changing 7, we change P to P + AP, we then find 


0, (t) = F059 (16) 
Gp (t) ~ op, t+2yAP(1-e *-R+Re *), (17) 


There is therefore no change at the warmth center but there is a 
transient and a permanent change in the touch center due to the ap- 
plication of the added pressure AP. We see therefore that the 
above model provides an adequate representation of both tempera- 
ture and pressure changes. 

If, in addition to the relatively rapid inhibitory factor j, there is 
another slower accommodative factor which may be a neural or a 
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circulatory effect, then the value of T, could change slowly and 
thus take on various values as does actually occur. 

In the above calculations we have assumed that Vw = Yo wits 
natural to suppose that both warm and cold receptors are nearly 
alike with respect to the effect of pressure but it would not be ex- 
pected that y, and y,, are exactly equal. If not, then the response 
to warmth would have a small dependence on pressure which would 
be positive if y, > y,, but negative if y, <y,. Unless the effect 
were fairly large it would not be noticed without looking for it. 
Such an effect would be referred to as an illusion, In a similar 
way, if the sensitivity to warmth were somewhat greater than that 
to cold, the response to pressure would be increased by warmth. 
In addition to these effects, there are the effects which arise from 
the fact that the actual relations are non-linear. 
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A mathematical framework for a rigorous theory of general systems is 
constructed, using the notions of the theory of Categories and Functors 
introduced by Eilenberg and MacLane (1945, Trans. Am. Math. Soc., 58, 
231-94), A short discussion of the basic ideas is given, and their possi- 
ble application to the theory of biological systems is discussed. On the 
basis of these considerations, a number of results are proved, including 
the possibility of selecting a unique representative (a ‘‘canonical form’’) 
from a family of mathematical objects, all of which represent the same 
system. As an example, the representation of the neural net and the 
finite automaton is constructed in terms of our general theory. 


I. Introduction. In a previous paper (Rosen, 1958), we attempted 
to investigate some of the aspects of a general theory of biologi- 
cal systems and to point out several of the possible applications 
which such a theory might have. We remarked at that time that 
although our treatment was an intuitively reasonable one, it was 
not yet of sufficient scope to provide the foundation for a general 
theory. It is the purpose of the present paper to remove the am- 
biguities contained in our earlier approach and to show how a pre- 
cise mathematical theory of systems may be constructed. 

Before we undertake the construction of our general theory, it 
may be of value to point out some of the difficulties which arise 
in an attempt to formalize our earlier treatment. We recall that an 
arbitrary system might be decomposed into a collection of smaller 
objects called components and that these components could be 
arranged in an oriented graph (the block diagram) as follows: two 
components M,, M, are to be connected by an oriented edge 
M, — M, if an output of M, is an input to M,. However, such an 
oriented graph is unable to provide an adequate representation for 
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many situations and may hence lead to a distorted picture of the 
actual input-output relations obtaining between the components of 
a system. For example, the number of distinct outputs produced 
by a component M, is not necessarily the same as the number of 
oriented edges for which M, is origin in the block diagram, since 
more than one component may receive the same input from M,. 
This will be the case, e.g., if M; were an endocrine gland that 
produces a hormone which affects several different organs. On the 
other hand, a component M, might provide a component M, with 
more than one distinct output, although the block diagram will only 
indicate one oriented edge connecting M,; and M,. To illustrate 
this, we may once again take M, to be an endocrine gland, such as 
the pituitary, which provides a number of different hormones to the 
same organ. 

Further, it is necessary for formal purposes to adjoin to the 
block diagram a formal vertex which represents the environment. 
This vertex must be connected as origin to all components which 
receive environmental inputs and as terminus to all components 
which produce environmental outputs. Since this vertex is not 
itself a component, its introduction necessitates a troublesome and 
artificial attention to special cases in many arguments. 

Even more important than the foregoing is the necessity of hav- 
ing an adequate means for discussing the behavior of the various 
time lags which occur in the operation of a biological system. We 
remarked in our earlier paper (loc. cit.) that the operation lags of 
the various components of a system will depend in general upon 
the particular inputs with which the component is supplied. There 
is no obvious and easy means of accomplishing such a representa- 
tion within the framework of a pure graph theory. Likewise, the 
requirement that a component producing several distinct outputs 
will in general possess a different lag at each output is slightly 
troublesome, 

Although we may to a certain extent overcome the difficulties 
we have mentioned by the introduction of a number of technical de- 
vices, the theory which results will have lost the intuitional clar- 
ity which constituted a large part of its appeal. It is therefore 
evident that a suitable representation theory of biological systems 
will require an entirely different point of view than the one adopted 
above and will utilize a new set of mathematical tools. The repre- 
sentation theory which we present below will be seen to fulfill 
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these conditions. The mathematical tools which we find appropri- 
ate are contained in the comparatively recent theory of categories 
and functors (Eilenberg and MacLane, 1945). 

Since this theory is largely unfamiliar to non-mathematicians and 
since the emphasis which is placed on the theory in the mathemati- 
cal literature is quite different from the one which we will require, 
we enter into a brief discussion of its basic notions before we con- 
struct our representation. It will be seen that, although the theory 
which results seems at the outset to be considerably more compli- 
cated than our previous treatment, we can formulate our results, 
and even our definitions, in a simpler, more intelligible and more 
precise fashion than is possible through any refinement of our 
other approach. 


Il. The Formal Representation. Throughout our previous dis- 
cussion, we have regarded a component of a system M as a “‘black 
box,’’ capable of receiving a number of inputs and of emitting a 
number of outputs. We observe that this behavior implies that each 
component acts selectively upon its environment at each input, 
that is, each input to a component must belong to a fixed set of 
admissible inputs in order to be accepted by the component. Like- 
wise, each output of a component is an element of a fixed set of 
admissible outputs. Boti of these sets are completely determined 
by the component itself, Thus, for example, an ordinary audio- 
amplifier will accept as input only the elements of a certain set of 
electric currents and will produce as output another element in 
this set. Likewise, only the elements of a definite set of chemi- 
cal compounds can serve as inputs to a given enzyme; this set is 
termed the set of substrates of the enzyme; the outputs of the en- 
zyme will similarly belong to fixed sets of chemical compounds. 

With these ideas in mind, let us attempt to find a suitable repre- 
sentation for the simplest type of component. We consider a com- 
ponent M which receives a number m of inputs and emits a single 
output. In a block diagram, M would be represented as a vertex 
which serves as the terminus for m directed edges pyy +++» Pm and 
as origin for a single arrow p. According to our discussion above, 
to each arrow p, in the block diagram there corresponds a definite 
set, the elements of which are capable of serving as inputs to M. 
Let us designate the set corresponding to p; by A, for each ¢ such 
that 1<2z<m, and the set corresponding to p by B. Then we can 
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regard the effect of M as a mapping or transformation, which as- 
signs to every m=tuple (a@,, «++, @,), %;€A;, a definite object 
beB. In formal terminology, we assign to the component M a map- 
ping f, where 


[UA XAG nw ee 


In words, M is to be represented by a mapping f, the domain of 
which is the ‘‘cartesian product”? of the sets of admissible inputs 
to M, and the range of which is the class of admissible outputs 
of M. 

In the general case, where we allow the component M to emit 
n > 1 outputs, we must in general assign a mapping /; to each out- 
put p, of M. Thus, if B, is the set of admissible output objects 
produced by M at p;, we write 


fe Ay We cee te — Bye 


Hence a general component is to be represented by an n-tuple of 
mappings, where n is the number of outputs of the component. 
These mappings all possess a common domain; namely, the set 
A, xA,x...xA,, but their ranges will differ, in general. 

~ These simple remarks provide the main conceptual basis for the 
ensuing discussion; the remainder of the paper is largely an elabo- 
ration and formalization of these remarks. We may observe that 
regarding a biological system as a set of mappings incorporates 
most of our intuitive notions about these systems in an extremely 
natural way. Further, we see that most of the difficulties men- 
tioned in the Introduction have disappeared. Thus, if a component 
M provides the same input to more than one component, then this 
merely means that some of the mappings which represent M are the 
same. Likewise, it is seen that no difficulty can be caused by a 
component M providing a second component with more than one 
distinct input, 

With this background in mind, let us proceed to introduce the 
mathematical tools which will provide the formal basis for our 
representation, We shall proceed axiomatically. The above dis- 
cussion, together with some elementary examples which we intro- 
duce to fix the ideas, should serve to motivate our presentation. 

The first notion which we introduce is that of a category. Ex- 
plicitly, a category consists of the following data: 


1, A collection of objects, which we shall designate by A, 
A’, eeoe 
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2. A function assigning to each pair (A, A’) of objects in the 
category, a set denoted by H(A, A’), the elements of which 
are called mappings or transformations. This set may be 
empty for some pairs (A, A’). 

If fe H(A, A’), then we shall call the object A the domain of f; the 
object A’ will be called the range of f. 

The data above represent the irreducible minimum which must be 
given in order to construct a theory of a set of mappings; namely, 
we must be given both the mathematical objects on which the map- 
pings act and the mappings themselves. We must now introduce a 
means of combining, or composing, the mappings which we are 
given. Hence, we further require of a category: 

8. There exists a function (called composition) which assigns 

to pairs (f, g) of mappings such that feH(A, A’), ge H(A’, A’), 
a mapping (denoted by gf) in the set H(A, A’’). 
The composition operation may perhaps be more clearly repre- 
sented by means of a diagram of mappings of the type shown below: 


t ‘ 
A—————— > 4 


We shall have occasion to make further use of diagrams of this 
type throughout the ensuing discussion. A diagram such as the 
above, in which any two directed paths of mappings with the same 
origin and terminus yield the same resultant, is called a commuta- 
tive diagram. 

As an example of a category, we may consider the following: 
let the objects of the category consist of all groups, with the map- 
pings of each set H(A, A’) being the group homomorphisms of A 
onto A’, Or, we may take for the objects of the category the total- 
ity of all topological spaces, with the mappings of each H(A, A’) 
taken to be continuous mappings. For many other examples of 
such structures, we refer the reader to the paper of Eilenberg and 


MacLane (1945). | 
It will be seen from these concrete examples that the data 1-3 


above must be subjected to certain axioms in order that our ab- 
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stract categories may enjoy properties suggested by these ex- 
amples. Thus, we shall require the following: 

Cat.al: "H(A, A’) OHA hae ae 
whenever A # A, and A’ # A,’. 

Here % is the empty set. This axiom merely states that any map- 
ping has exactly one domain and exactly one range. 

Cat.2: {cH (A, A’), 9g eH(4t, AD, and teh AZ 2. 
then A(gf) = (hg) f. 

This axiom states that the composition of mappings satisfies the 
associative law. 

Cat. 3: For each object A in the category, there exists a map- 
ping 7, ¢€H(A, A), such that if A’ is any other object 
and feH(A, A’), then fi, =f; for any mapping 
g«H(A’, A) we havei,g =9. 

The mapping 7, is called the identity map of the object A. We may 
remark at this point that since there exists by (Cat. 3) a one-to-one 
correspondence between the objects of a category and their iden- 
tity maps and since by (Cat. 1) each mapping of the category 
uniquely determines its domain and range, we could have dis- 
pensed with the objects entirely and defined the category solely in 
terms of its mappings. We shall find that this is the generaliza- 
tion of a remark made earlier with reference to biological systems, 

Let us now denote by A an arbitrary category. We define a sud- 
category A, of A to be a collection of objects and mappings of A 
satisfying the following conditions: 

Sub. 1: If AeA), then 7,4 «A. 

Sub. 2: If f and g are mappings in A, such that the composi- 
tion gf is defined, then gf«A,. 

Sub. 3: If f is a mapping in A), then the domain of f and the 
range of f are objects in A). 

It is readily verified that the objects and mappings of a subcate- 
gory themselves comprise a category. 

We now introduce the notion of functor, In general, whenever a 
mathematical structure has been defined axiomatically, we desire 
a means of comparing, in some sense, different objects which bear 
the structure in question. Most frequently, we proceed by con- 
structing various structure-preserving mappings between such ob- 
jects. For example, groups may be compared with one another by 
means of homomorphisms, and topological spaces by means of con- 
tinuous mappings, Categories, then, may be compared by means of 
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some generalized type of mapping; such a mapping is called a 
functor. We now proceed to give a precise definition. 

Let A and B be categories. A covariant functorT: A > B is a 
pair of mappings (the two denoted by 7, the first of which assigns 
to each object AeA an object, which we shall write as 7 (A), in B; 
and the second of which assigns, to each mapping fe H(A, A’) in A, 
a mapping which will be written as T(f)eB, T(f)eH(T(A), T(A’)1. 
We shall require the following conditions to be satisfied: 

Fun. 1: T (gf) = T (g) T (f) whenever gf is defined. 

Fun. 2: T(i,)=t7 ra) for each AeA, 
We remark that it is possible to define contravariant functors, 
which behave precisely as do covariant functors, except for the 
fact that they reverse the directions of the mappings. Throughout 
the following discussion we shall deal exclusively with covariant 
functors which we shall refer to simply as functors. 

A functor T is called fazthful if the following conditions are 
fulfilled: 


Fid. 1: If f and g belong to H(A, A’) and T (f) = T(g), then 
Fid. 2: If feH(A, A’) and geH(A’, A) are mappings such that 
gf =i,; andif T(A) = T(A’), T(f) =27,4); then A = A’, 


Mappings f/f, g which satisfy the conditions mentioned in the postu- 
late (Fid. 2) are called equivalences of the objects A, A’; in the 
case where A is a category of groups, its equivalences are iso- 
morphisms; in a category of topological spaces, the equivalences 
are homeomorphisms. 

It is easily verified that the image of a category A under a func- 
tor T: A —> B is a subcategory of B. If T is faithful, then this 
image, which we may denote symbolically as T(A), is abstractly 
indistinguishable from A, and we may in this fashion regard A as 
being embedded by T as a subcategory of B. 

With this background, we may now state a theorem which will be 
of great importance to our theory of representation of systems: 

Theorem 1. Any abstract category A can be embedded as a sub- 
category of the category S, the objects of which consist of all sets 
and the mappings of which are the totality of all set-theoretical 
many-one mappings of sets. 

Proof. The proof of this theorem is given in the paper of Eilen- 
berg and MacLane (1945) and consists in the explicit construction 
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of a faithful functor from A to S. We refer the reader to the origi- 
nal paper for the details. 

As a consequence of Theorem 1, we may by choosing a definite 
faithful functor T: A —» S, regard the objects of an arbitrary cate- 
gory A as ordinary sets and the mappings of A as ordinary set- 
theoretical mappings. Hence in this manner the usual set-theoreti- 
cal terminology will make sense in an arbitrary category. In par- 
ticular, we may speak of unions, intersections, inclusions, and 
cartesian products of objects in a category. We shall have more 
to say about this aspect in Section IV below; we merely observe 
here that in general an arbitrary category will not be closed under 
these set-theoretical operations. For our purposes, however, we 
shall require from the outset that the categories with which we 
deal shall be closed under the formation of cartesian products. 

This completes our digression on the mathematical tools which 
we shall utilize in the representation of biological systems; we 
now turn to the construction of the representation itself. Our point 
of departure is the observation made earlier that, given an arbitrary 
category A, we can form oriented graphs (called diagrams on p. 317) 
by selecting a collection or objects {A;3 in the category and a col- 
lection of mappings from the appropriate sets H(A,, A,;). Two ob- 
jects in the diagram will be connected by an oriented edge if and 
only if some mapping of the collection has one of the objects as 
range and the other as domain. For our purposes, however, we 
shall require a slightly more general type of diagram in which the 
objects of the diagram merely contain the domains and ranges of 
the mappings. 

Let us agree to denote by d(f) and r(f) the domain and range, 
respectively, of an arbitrary mapping in a category. Then we shall 
say that a collection of objects {A} and a collection {ft of map- 
pings in a category comprise an abstract block diagram if the fol- 
lowing conditions are satisfied: 


A. B.D. 1: If f is a mapping in the collection, then there exist 
objects A, A’ in the collection such that d(f) c A, 
r(f) ¢ A’. 

A.B.D, 2: If A is an object of the collection and A is of the 


form A, x A, x...xA,,, then A, is in the collec- 
tion, for l<i<m, 
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APD af A =A, x A. x... x A, and there exist mappings 
f; in the collection such that r(f;) C A, for each ¢, 


then for any mapping g in the collection such that 
d(g) C A, we have 


d(g) n [r(f,) x Ab rg bohrete aa € Sa) ese 


These abstract block diagrams can serve as a vehicle for the 
representation of biological systems. To clarify the above defini- 
tion, let us once again recall the situation which we are attempting 
to formalize. We desire that each object A in the abstract diagram 
shall represent a definite class of physical materials, capable of 
serving as inputs or outputs to a component of some system, to 
which the diagram may correspond. (Since not every such diagram 
may correspond to a physical system we use the adjective ab- 
stract.) Each component of the system is itself represented by a 
collection of mappings in the diagram, all of which possess a 
common domain. Since it is in general possible for a component 
to accept a wider class of inputs than may be provided by other 
components of the system, we have allowed the inclusions in our 
definition, instead of requiring strict equality. 

The three axioms we have imposed on an abstract block diagram 
will be seen to have simple interpretations in this context. The 
first axiom expresses in concise terms the statement that every 
component of a system shall have at least one input and at least 
one output, The second axiom is essentially a kind of closure 
condition and states explicitly that if a component receives an 
input, then that input must be represented within the system. The 
third axiom ensures that the system will actually be capable of 
stable operation, in the sense that the mappings corresponding to 
an arbitrary component will actually be defined on at least some of 
the possible inputs to the component. 

Before we proceed to consider a specific example, let us pause 
for a moment to comment upon some of the aspects of the represen- 
tation theory outlined above. First, we notice that the great utility 
of Theorem 1 lies in enabling us to consider all the classes 4, 
which we may encounter as being sets in the mathematical sense, 
and thereby enjoying all the properties and operations of sets, 
while allowing us to sidestep the dangerously metaphysical ques- 
tion of whether, and in what sense, these classes ‘‘really’’ are 
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sets. Second, we observe that, although our representation takes 
the form of an oriented graph, just as did the cruder representation 
which we described in the Introduction, we find a peculiar inver- 
sion of vertices and oriented edges between our abstract diagrams 
and the block diagrams with which we started, That is, a vertex 
in our original block diagram was thought of as representing a com- 
ponent of the system, whereas in our abstract diagrams the compo- 
nents are represented by mappings, i.e., by oriented edges. Like- 
wise, the oriented edges of the original block diagram were to 
represent the inputs and outputs of the system, whereas in our ab- 
stract diagrams we find that these inputs and outputs are repre- 
sented by objects in a category and hence are vertices in the dia- 
gram. 

To illustrate the inversion between edges and vertices men- 
tioned above, as well as to bring out clearly the manner in which 
our representation theory operates, let us consider as a particular 
example the representation of the system M, the hypothetical reso- 
lution of which into components (block diagram) is shown in Fig- 
ure 1, 

According to our representation theory, there is a set of map- 
pings corresponding to each component M;. Each of the compo- 
nents M, except for M,, emit a single output; hence each of these 
will be represented by a single mapping which we shall designate 
by f; The component M,, however, produces two outputs and 
hence is to be represented by a pair of mappings (one for each out- 
put), which we shall write as f,“” and f,@ (Figure 2). 

Next, we must consider the representation of the links p; shown 
in Figure 1, Let us suppose that p; is an output of a component 
M, and an input to a component M,. We recognize two possible 
cases, according to whether p; is the only input to M, or not. If 
there are no other inputs to M,, then we may represent the link P; 
by the set A,= r(f) vu Lu d(f, ls where 7; and ete are mappings 


corresponding to the components M,, M, respectively. If M, re- 
ceives other inputs besides fj, then ‘for each we find that df, ) 
is a cartesian product, such that r(f;) is contained in one of the 
sets which constitute the factors of “this product. We may then, 
for each a, project d(f,.) onto the set containing r(f,)s in the 
sense of Cartesian Products (see e.g., Chevalley, 1956, p. 15); let 
us denote this projection by qd; (fs Finally, we define the set 


—_— ore”, 
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A 6, R 
Tf fp 


FIGURES 1 and 2. The component M, in Figure 1 corresponds to the 
mappings /,‘ and f, in Figure 2. The component M, corresponds to /,; 
M, -— to f;; etc. The input 9, in Figure 1 corresponds to the ‘Sobject”’ A, 
in Figure 2. The inputs », and ?. into the component M, correspond each 
to the objects A, and A., respectively. The total input into component 
M, in Figure 1 thus corresponds to the Cartesian product A, x A, in Figure 2. 


A, which represents the link p; in the analogous manner as we did 
in the simple case; namely, 


Aeriy [yatta 


The reader can verify for himself without much trouble that this 
choice of mappings and objects constitutes an abstract block dia- 
gram, as we have defined it above. 

All these considerations lead from the block diagram of Figure 1 
to the network of mappings shown in Figure 2. We observe that 
this representation has removed the difficulties mentioned above 
concerning the representation of the environment E, We have taken 
FE into account in a natural fashion among the 4,. Also, the in- 
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version of vertices and edges between the diagrams of Figure 1 
and Figure 2 is clearly shown; we remark that care must be taken 
to avoid confusion which can easily arise due to this inversion. 

Since the same lines of reasoning which led from Figure 1 to 
Figure 2 may be applied to any biological system, we obtain our 
Representation Theorem. 

Theorem 2. Given any system M and a resolution of M into com- 
ponents, it is possible to find an abstract block diagram which 
represents M and which consists of a collection of suitable ob- 
jects and mappings from the category S of all sets. 

Proof. Completely contained in the above discussion. 


Ill. Canonical Forms. Our construction of an abstract block 
diagram to represent a given system, which we outlined in the pre- 
vious section, was contingent upon a particular decomposition of 
the system into components. Since it is in general possible to de- 
compose a given system in more than one manner, it therefore fol- 
lows that we may correspondingly find a large number of abstract 
block diagrams which represent the system. In this manner, we 
may introduce an equivalence relation into the collection of ab- 
stract block diagrams which we may form from the objects and 
mappings of a given category, calling two abstract block diagrams 
equivalent if they are both representatives of the same system. It 
is the purpose of the present section to investigate some methods 
by which we may select a definite representative from each such 
equivalence class of abstract block diagrams; this is called a 
problem of determining canonical forms for block diagrams. 

We shall first introduce some customary set-theoretic terminol- 
ogy which we shall find useful in formulating our results. If we 
are given a collection § of sets and a 1-1 onto mapping between S 
and the elements of some set /, then we shall say that the mapping 
indexes the collection S by the elements of / (the set / is then 
called an indew set for S). We shall write an indexed collection 
as (A,),,,. In what follows, we shall always restrict ourselves to 
finite collections. If the collection contains n sets, then we shall 
take the index set / to consist of the first n positive integers. 
This being so,:we shall assume that the idea of the cartesian 
product of a family of sets (A,);-, is known. We shall write the 
cartesian product of an indexed collection of sets as ITA; 

ie! 
If J is an index set, we shall call a 1-1 mapping of / onto itself 


es Ae 
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as a permutation of I, and we shall denote such a permutation by 
n{I). There is a nautral equivalence (1-1 onto mapping) between 


the cartesian products JJA, and J] A, for any permutation 7; 
ie] ié€m(1) 

this equivalence merely expresses the general commutativity law 

for cartesian products. A subproduct of a cartesian product ITA, 

i€] 

is obtained by writing ITA, where J C J, Let us suppose that /] 
yes 

contains n elements and J contains m <n elements. We shall de- 


fine the projection of JJA, on ITA, in the following manner: 
ie] jes 


Choose a permutation of 7 such that the elements of the set J be- 
come precisely the first m elements of 7(/). Then map any ele- 


ment ze ITA, onto its first m elements; this will be a well-deter- 
i€] 
mined element in ITA, 
jel 
This procedure of projecting a cartesian product upon a sub- 

product may be expressed by means of the following diagram: 

i er A; 

i€l ieTT) 


S | 
TT A; 
jel 
Here P represents the desired projection, 6 is the natural equiva- 
lence between the product and the permuted product, ‘and P is the 
mapping described above which sends each element of the per- 
muted product onto its first m elements. It is clear that P = P-9@, 
in the sense of composition of mappings; i.e.,:the diagram is 
commutative. 
Finally, if A,.B, C are objects in a given category A, ‘and 
f: 4— B is a mapping in the category, : then 7 is said to be factor- 
able through C if there exists mappings ¢ : A> C,%~:C— Bin 


A such that the diagram 


g 
A————-_- C 


is commutative. 
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With these preliminaries, we may now undertake the main por- 
tion of our discussion. To motivate our procedure, let us suppose 
that we are given a block diagram of a system M; let M, be an ar- 
bitrary component of M with m inputs and n outputs. According to 
our representation theory, M, will be represented by a family of n 
mappings, which we designate as before by Teo? Oa Loe aie 
The domain of each of these mappings f, ots contained in the car- 
tesian product JJA,, where / is the set of the first m integers. 

i€] 
We now proceed to investigate in detail the possible behavior of 
the mappings f,. 
In particular, it is important for our purposes to investigate the 


question as to whether it is possible to factor a mapping 

fe IT4,~ B through a subproduct of ITA; i.e., whether it is 
i€ i€ 

possible to find a set J C / such that the diagram 


Pp 
TT AG ————— TAS 
f fk 
ke bas 
B 


is commutative, where P is a projection, ‘and Fi is prime with re- 
spect to all subproducts of ITA,- We remark that such a factori- 
yéJ 
zation, if it exists, is unique, as can readily be proved. Physi- 
cally, the possibility of factorization of a mapping f,.. means of 
course that the ath output of M, does not itself require all the 
stated inputs to M, in order to be produced, but only a certain 
subset of them; i.e., certain of the A, are superfluous in the 
domain of ir 
If we Seo these factorizations on each mapping tes in the 
abstract block diagram representing M, we obtain a new collection 
of mappings hao? each defined on a subset of a certain subproduct 
of ITA; It may of course happen that factorization is not possi- 
i€ 
ble for some a, in which case we have Pig = at Fe We can now 
collect these mappings into classes, : putting two. mappings into 
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the same class if and only if they have the same subproduct of 
ITA, as domain. Physically, it will be seen that the possibility 
i€ 

of factorization implies that our original choice of components was 
too coarse. In other words, the component M, may be more accu- 
rately thought of as consisting of a number of components, each of 
which is specified by a set of factored mappings is all of which 
have as common domain a subset of the inputs to M,, ‘and none of 
which are ‘‘superfluous.’’ 

We may carry out this procedure on every component in the block 
diagram of the system M. It is clear that the factored mappings 
thus obtained, together with the subproducts which constitute 
their domains and ranges, provide us with a new abstract block 
diagram which is equivalent to the one with which we started, in 
the sense defined above. Further, it is obtained by a sequenceof 
canonical operations (i.e., factorizations through subproducts) and 
hence is in this sense unique. Hence, we have a Canonical Form 
Theorem. 

Theorem 3. Given a block diagram for an arbitrary system M, 
we can find an abstract block diagram representing M such that 
none of the mappings of the abstract diagram is factorable through 
any subproduct of its domain. 

Proof. See the above discussion. 

This particular type of canonical form for abstract block dia- 
grams seems to be the most natural one to consider. However, 
there are other types of canonical decompositions which suggest 
themselves, of which we mention one in particular, because of its 
relation to the study of general automata. Its most convenient 
formulation is the following: 

Theorem 4. Given a decomposition of an arbitrary system M into 
components, we may find a further decomposition of M such that 
every component of the new decomposition emits exactly one out- 

ut. 
: Proof. Let M, be a component in the given decomposition of M. 
If M, already emits exactly one output, :there is nothing to be 
done. Hence, let us suppose that M, emits m>1 outputs. In the 
abstract block diagram representing M, the component M, 1s tepre- 
sented by a family of m mappings icp Os 1, 1.25 Mm Ns now re- 
construct a block diagram by allowing each of the mappings fi, © 


correspond to a new component which we shall call My and which 
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is provided with the same inputs as M,. This process is repeated 
for every component in the block diagram. The details of the 
necessary connections required to turn this set of mappings into 
an abstract block diagram equivalent to the one with which we 
started are slightly tedious, but elementary and will be omitted. 


Let us now briefly turn our attention to the problem of the possi- 
ble behavior of a system under the termination of an input to a 
component M,. This discussion will bring to light certain aspects 
of our representation theory and at the same time will lead to a 
new formulation of Theorem 3. 


To this end, let M, be a component which receives m inputs and 
emits n outputs. Then M, is to be represented, as usual, by the 
mappings ipo a=1,..., 2 According to Theorem 3, each of 
these mappings will have a common domain, which we may write 
as JJA,, where /] may be taken as the set of the first m integers. 

ie! 
Now the suppression of an input to M, means precisely, in our 
terminology, the replacement of one of the sets A, (A; say) by 
the empty set. This is in turn equivalent to saying that the do- 
main space of the mappings Pe has been reduced to a subproduct 
ITA, where J is the set / with the index 7, deleted; i.e., J =7/- 
jéJ 
lig}. But since the mappings ih may by virtue of Theorem 3 be 
assumed to be prime to subproducts of JJA,, they are in general 

ie] 

not defined on any such subproduct. This may perhaps be seen 
more clearly from a consideration of the analogous fact that a 
function of n variables will assume a definite value only if a value 
is given to each of the variables. Even in mathematical terms, a 
cartesian product, one factor of which is the empty set, is itself 
empty. 

llowever, we know that there exist many systems which contain 
components that will produce some kind of output, even when 
one or more of their inputs are terminated. We represent this state 
of affairs as follows: We consider that a mapping i » Which repre- 
sents a component M,, may be of such a nature that a restriction 
rule to subproducts for this mapping is given along with its defini- 
tion. Thus, the mapping may still be defined on suitable sub- 
products ITA, of JJA, Such a mapping will be called contracti- 


yéJ ie] 


ble to the subproduct ITA; We shall call the elements of the 
7éJ 
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range of a contracted mapping defective outputs, in line with 
their physical significance. If a mapping is such that no restric- 
tion rule to subproducts is given, the mapping will be called non- 
contractible. It will immediately be seen that this definition of 
contractibility is precisely the translation of the definition we pro- 
posed elsewhere (Rosen, 1958) into the terminology of our repre- 
sentation theory. 

We may now restate Theorem 3 in a manner which more clearly 
shows its relationship to the concepts we have introduced above: 

Theorem 5. Given a block diagram for an arbitrary system M, 
we can find an abstract block diagram representing M, ‘such that if 
any set A is removed from the diagram, then every mapping f such 
that A is a factor of d(f) produces defective outputs. 

Proof. A restatement of the proof of Theorem 3, using the termi- 
nology introduced above. We leave the details to the reader. 

The notion of contractibility is the device used in our repre- 
sentation theory to discuss the deeper intrinsic properties of bio- 
logical systems. The simplest possible case, of course, is that 
in which all mappings of the diagram are non-contractible; this 
case was discussed at some length in our previous paper (Rosen, 
1958). In the more general case, we notice that one of the impor- 
tant consequences of contractibility is that the range of a mapping 
will in general be altered by contraction, while the domains of 
other mappings, for which this domain may serve as factor, are left 
unaltered. There is no guarantee that these domains and ranges 
will ‘‘match up’’; i.e., have a non-empty intersection. Hence a de- 
fective input may serve as no input at all to certain components. 
It will be seen intuitively that the notion of the poisoning of com- 
ponents by means of certain inputs is closely related to this type 
of situation. Also closely related to the above is the general 
problem of determining the dependent sets of the components of an 
arbitrary system, in the sense defined by us (Rosen, 1958) . 
These problems seem very difficult. 

As a final word in this context, we may remark that it is possi- 
ble to define a concept of the expandability of mappings in an ar- 
bitrary abstract block diagram, ‘corresponding to the contractibility 
we have discussed above. The justification for this notion is the 
fact that biological systems, when placed in a richer environment, 
may take on new properties (i.e., emit new outputs). The notion of 
expandability may perhaps be a useful means for attacking this 


problem. 
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IV. A Theorem on Functors. It was mentioned at the time of 
their definition that functors furnish a natural means whereby cate- 
gories can be compared, much as groups can he compared by homo- 
morphisms and topological spaces by continuous mappings. Simi- 
larly, it follows that any structures formed from the objects and 
mappings in various categories may likewise be compared by func- 
tors.’ Thus, given an abstract block diagram (which we may denote 
by M) of objects and mappings in a category A, we may apply a 
functor 7: A —> B and obtain in the category B the collection of 
images under T of the objects and mappings of M. We may write 
the image of the abstract block diagram M as T(M). One of the 
natural questions to ask concerning a given functor 7 is whether 
the image of an abstract block diagram under T is again an ab- 
stract block diagram. The present section is devoted to a discus- 
sion of this question. 

First, we observe that Theorem 1 implies that, in the study of 
any functor T: A — B, we may as well assume that ACS, BCS, 
where S is the category of sets. Now S has certain structures of 
an algebraic nature imposed upon it, and these structures were 
utilized in the construction of the abstract block diagram. For in- 
stance, S has a local Boolean structure, imposed by the operations 
of intersection and union. Further, the objects of S form a par- 
tially ordered space under the operation of set inclusion, and a 
commutative semigroup under the cartesian product operation. 
Now an arbitrary functor T: A — B need have no relation to these 
structures, even if T is faithful; just as a 1-1 set-theoretic mapping 
between two groups need have no relation to the group structures. 
If we wish these structures to be preserved, then we must impose 
the necessary restrictions upon the functors which we shall con- 
sider, 

Let us consider at the outset functors T which map the category 
of sets into itself; we shall denote this situation by 7: S S, 
We shall say that such a functor is regular if it satisfies the fol- 
lowing conditions: 


Reg. 1: If AeS and A ¥Q, then T(A) #9. 
Reg. 2: If ACB, then T(A) c T(B), 


The first condition is a non-triviality type restriction. The second 
condition requires that T preserve the partial ordering placed on 
the objects of S by the inclusion operation. 
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We shall now prove a lemma, which will be required in the proof 
of the main theorem of this section: 

Lemma. If T: S +S is a regular functor, and AB #9, then 
T (A) 0 T(B) #9. 

Proof. The hypothesis A 4 B #4 $ means explicitly that there ex- 
ists a set C #§ such that CC A andC CB. By (Reg. 1), T(C) #9. 
By (Reg. 2), we have T(C) Cc T(A) and T(C)c T(B). Thus by 
definition we have that T (A) 1 T(B) #9. 

Finally, we agree to call a functor T: S + S multiplicative if 
for any two sets A,, A,, we have that T(A, x A,)=T(A,)x 
T(A,). Thus, a multiplicative functor preserves the cartesian 
product operation of S$ and may in fact be regarded as a semigroup 
homomorphism on S, 

We are now ready to state the main result: 

Theorem 6. Let M be an abstract block diagram which repre- 
sents a definite biological system. Let T be a faithful functor. 
Then 7 (M) is an abstract block diagram which represents the sys- 
tem if and only if T is regular and multiplicative. 

Proof. First, let us assume that T satisfies the conditions of 
the hypothesis. We must then show that T(M) satisfies the three 
conditions (A.B.D. 1-3) which we have laid down for an abstract 
block diagram. 

(A.B.D. 1) is satisfied by any functor 7. 

To verify that (A.B.D. 2) is satisfied, let f: JJ4,; > B be a 

Te 
mapping in M. On applying the functor T, this becomes Tf (f): 
T([JA,) — T(B). The multiplicativity of T, however, implies 
iél] 


that T(JJA,) = JJ{T(A)]. But by (A.B.D. 2) we know that, for 
ae! ie! 


each index i, A,eM. Hence each 7 (A),) is in T (M), and therefore 
(A.B.D. 2) holds in T (M). 

To say that (A.B.D. 3) holds in T (M) is precisely to say that ve 
is regular, as a glance at the definitions and the lemma proved 
above will verify. 

Thus, we have shown that 7'(M) is an abstract block diagram. 
That it must represent the same system as M follows from the 
faithful character of 7. Thus, half our theorem is proved. It re- 
mains to show that if 7 (M) represents the same system as M, then 
T is faithful, regular, and multiplicative on the objects and map- 


pings of M. 


336 ROBERT ROSEN 


First, we show that 7 is faithful. The faithfulness of T on the 
mappings of M is a consequence of the fact that two components in 
the system represented by the same mappings in M must be repre 
sented by the same mappings in T(M). The faithfulness of T 
on objects follows from the fact that, if a component of a sys- 
tem provides the same output to two distinct components, then 
this output must be represented by equivalent objects in M and 
in T (M). 

To prove multiplicativity of T, we observe that, if | JJA, +B 

ie] 


is a mapping in M, then the domain of T(f) is T(JJ4,). Bué since 
ie] 


T (M) represents the same system as M, the input to the component 
represented by T (f) must be precisely JJ[T7(A,)]. Comparing these 
ie! 


two, we see that the multiplicativity follows. 

The regularity of T follows from the same type of argument 
which proved the multiplicativity. 

Hence the entire theorem is proved. 

Corollary. If T: S + S$ is a functor such that, for any abstract 
block diagram M the objects and mappings of which are in S$, T (M) 
is again an abstract block diagram equivalent to M, then T is 
faithful, regular, and multiplicative on all of M, and conversely. 

Proof. Immediate from Theorem 6. 

Theorem 6 is a first step in the very difficult problem which 
plagues attempts at representation of biological systems; namely, 
to prove that equivalent biological systems have the same repre- 
sentation and, conversely, to show that two mathematically equiva- 
lent representations actually correspond to the same system. We 
are still far from achieving this result, since among other things 
we have only considered the block diagram aspect of the structure 
of systems and have neglected time lags and other aspects of 
structure, Nevertheless, Theorem 6 tells us at least how we may 


begin to construct equivalent representations of biological sys- 
tems. 


V. Example. The McCulloch—Pitts-von Neumann Theory of 
General Automata. An automaton consists, in the first approxima- 
tion, of a black box and a finite number of inputs and outputs, each 
of which satisfies the special property that it is capable of assum- 
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ing precisely two observable states. Following the terminology of 
von Neumann (1945) we shall call these states the ‘stimulated and 
unstimulated states, An automaton is considered to be completely 
specified when it is known which of its outputs assume the stimu- 
lated state for every stimulation of a subset of its inputs. 

On the basis of his general theory, von Neumann constructs 
every general automaton from a network of appropriate single-out- 
put automata (modulo the time lags of these automata, which will 
be discussed later), Hence it will be sufficient to restrict our at- 
tention to this particularly simple type of automaton. Let us de- 
note by U a single-output automaton which receives n inputs. Each 
of these inputs, by definition, is capable of assuming precisely 
two states; let these states be denoted by the symbols 0 and 1 
which shall correspond to the unstimulated and the stimulated 
state respectively. In the terminology of our representation we 
may take, for the set A which represents the set of admissible ob- 
jects at each input of U, the set consisting of exactly two ele- 
ments 0, 1. This is the familiar ‘‘coin-tossing space,’’ and we 
write A = {0, 1}. 

Our representation further stipulates that the single output auto- 
maton U is to be represented by a single mapping f, the domain of 
which is the cartesian product of the sets of admissible objects 
corresponding to the inputs of U. Thus, the domain of the mapping 
f may be taken as the cartesian product of A with itself n times. 
This product may be expressed simply as the set of all sequences 
of 0 and 1 of length n; or equivalently, as the set of all binary 
digits of length n. 

Since the output of the automaton U is also capable of only two 
states, the range of the mapping / may be taken to be just the set 


A itself. Hence, if we introduce the notation A” = I A, (where 
=1 
A, =A for each index 7), then we have 
f; A” > A. 


Explicitly, f(z) = 1, eA” means that the total input results in a 

stimulation of the output of U; f(z) = 0 means the opposite. 
According to our general definitions, a mapping f may be fac- 

tored down to a subproduct A”, m <n, if and only if a mapping f 
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and a projection P: A” —> A™ can be found such that the diagram 


a 
4! 


is commutative. In the special case under consideration, we may 
embed A™ in A” in a variety of ways as follows: We have a subset 
J of the first n integers, the elements of which we shall denote by 
19 Igy e005 %,,¢ We can write this subcontract as 


Aly = (...0,4; 5 0, Leutis A;.s arena A; Sap t) . 


where of course A, =A, =...=A; =A. For example, if we 
1 t2 tm 


consider the subproduct pe obtained from A” by omitting the i*® 
co-ordinate of A”, we find that this can be expressed as 


AX; mA Alpe. Reno A). 


i-1? i+1? eee 


On this basis, it is now easy to verify explicitly that a neces- 
sary and sufficient commision for a mapping f to be factorable 
through the subproduct A“; 1 is that for every input 2¢A”, we have 


A ve cMenses A cane 4) = 104, ee. Pere: | A A,). 


FeL IES 


The situation is analogous for all other subproducts. We shall as- 
sume throughout the following discussion, as usual, that f is not 
factorable, 

We now turn our attention to the contractibility of the mapping f; 
that is, the behavior of the automaton U upon the termination of 
one or more of its inputs. We have already, in our discussion of 
factorization, implicitly used the fact that an empty input is equiv- 
alent to an input of 0. This means that we can treat this special 
case as if we did not really drop down to a lower dimensional sub- 
product upon terminating an input to U; and hence the mapping f 
will always be contractible, and in fact, to every subproduct of 
its domain. We can write down explicitly the definition of the con- 
tracted mappings, as follows: Let At) be a subproduct of A”, 
corresponding to an index set J = {i 19 tariveny t od Cl. Then we 
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define the contraction f, of the mapping f to Athy as follows: 


f,(z) = Lif and only if re[f-'(1) 0 eral 
f; () = 0 if and only if we[f~*(0) n AGI, 
where, by the natural embedding discussed above, x can be re- 
garded as an element of both AZ, and A", thereby giving sense to 
the definition of fy 

Let us, finally, suppose that thew is the subproduct obtained by 
terminating the i‘ input to the automaton U. Then according to 
the definition of the contraction of f to a it follows that the 


automation will be completely inhibited by this termination [i.e., 
f@ (A%5') = 0] if and only if the condition f~'(1)  A%5'=9 is 
satisfied and will be completely unaffected by this termination if 
and only if f~'(1) 9 Bese = f~'!(1). This last condition is equiva- 
lent to f-'(1) C A%;’. We further note that our assumption of the 
non-contractibility of f forces this last inclusion to be sérict, for 
if it were the case that f~!(1) = A"~1, then it would immediately 
follow that f would be factorable, through the one-dimensional sub- 
product (0,...0, A;, 0,...0). The generalization of this discus- 
sion of the behavior of a single-output automaton U to the termina- 
tion of one of its inputs to arbitrary subproducts (i.e., to the termi- 
nation of more than one input) is straightforward and can be 
omitted. 

We now turn to the problem of the representation of the general 
automaton. As mentioned earlier, each general automaton can be 
considered as a network of single-output automata, Our represen- 
tation theory shows that each single-output automaton can be rep- 
resented as a mapping the range of which is the ‘‘coin-tossing 
space’? A, and the domain of which is a cartesian product of A 
with itself, taken a number of times equal to the number of inputs 
to the automaton. Hence it follows that a category A sufficient 
for the representation of arbitrary automata is obtained by regard- 
ing the set of all finite cartesian products of A with itself as the 
set of objects in the category. The only mappings of the category 
will be the sets H(A”, A) which will be the totality of set-theo- 
retical mappings between A” and A. It is readily verified that A is 
indeed a category and is obviously closed under the cartesian 


product operation on the sets of the category. 
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It thus follows from the above discussion that a general auto- 
maton can always be represented as an abstract block diagram in 
the category A defined above. Conversely, it follows from Theorem 
4 that an abstract block diagram of objects and mappings in the 
category indeed represents a general automaton. Thus, we obtain 
a complete characterization (ignoring time lags, for the moment), 
of all possible general automata, It may be of interest to observe 
at this point that this last remark and Theorem 6 imply that any 
functor R: A — A, such that the image under F of an abstract 
block diagram which represents a general automaton is again a 
general automaton, is necessarily the identity functor on the ob- 
jects of A, for the image of any single-output automaton, repre- 
sented by the mapping f: A® — A, is transformed by F into 
R(f): R(A") — R(A). In order for this image to be a single-output 
automaton, we must have r[R(f)] =R(A) =A; by Theorem 6 we 
have R (A”) = [R(A)]”, which by our last observation is just A”. 

The brief discussion presented above is sufficient to indicate 
how the graphical aspects of the theory of general automata follow 
in a very simple manner from the formalism which we have devel- 
oped in the preceding sections. The general automaton is in fact — 
one of the simplest possible illustrations of our formalism, and 
indeed the entire theory of automata might have been derived, in 
abstracto, from this viewpoint. The application of category theory 
to more general kinds of systems becomes correspondingly more 
complicated, but at the very least, we hope to have indicated in 
the foregoing that the notion of systems introduced here can be put 
on a rigorous basis and that the results obtained by using those 
notions can be formally justified, 


The author is indebted to Professor N. Rashevsky for a thorough 
discussion of the manuscript. 
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A photochemical system is considered which is similar to that of 
S. Hecht but which includes an intermediate substance between visual 
purple and visual white. This intermediate is assumed to change the 
membrane voltage of the photoreceptors. A simple model is presented 
from which one may calculate the frequency of the impulses resulting 
from the application of light. It is found that this physicochemical model 
enables one to account for a considerable number of physiological 
phenomena of vision. Among these are the light and dark adaptation 
phenomena. By introducing a simple assumption regarding spatial inter- 
action one can account for some psychological phenomena such as the 
effect of intensity, light-dark ratio, shape, and area on the critical 
flicker frequency. A number of other phenomena are discussed in terms 
of the model. 


It is the purpose of this paper to consider the general problem of 
vision and to give some of the principal results of a new model of 
the neuro-element of the retina. In a subsequent paper it is 
planned to discuss in detail the fine structure of the retina and 
compare the experimental results with the theoretical expectations. 
These results will be given in connection with the study to deter- 
mine the minimum threshold quanta necessary to produce visual 
sensation, which is now in progress in the Department of Ophthal- 
mology at Nara Medical College, Japan. 
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The most difficult problem in this research has been to develop 
one theory which would explain both the physiological and the 
psychological phenomena of visual perception. Previously these 
two phenomena have been considered as separate matters. Another 
problem has been to formulate this one theory of visual perception 
into a definite form, since hitherto there has been no precise defi- 
nitions concerning the mechanisms of vision. 


I. General Conception of Visual Perception. In order to under- 
stand in detail the mechanisms of visual perception it is necessary 
to consider several very complicated processes. In general we may 
divide them into the following categories: 

1) The photochemical process occuring in each photoreceptor. 

2) The electro-neuro-physiological process of the propagation 

transmission and summation of the nerve impulses. 

3) The psychological perception of the visual stimulus. 

The phenomena belonging to each of these three categories 
usually are discussed separately as if they were completely in- 
dependent matters. For example, if we wish to explain visual per- 
ception from the photochemical point of view, we usually are not 
concerned much with the physiological aspects of the retina or 
with the psychological phenomena. However, all visual perception 
is produced as a whole from these three processes, so that if we 
wish to suggest any kind of model which will truly explain the 
visual process, we should consider some natural way to connect 
the three categories with each other. Otherwise it is impossible 
to discuss the process of visual perception, the most important 
problem at present in the field of the physiology of vision. 

It is well known that the reaction time to the light stimulus, the 
summation of the light stimulus, and the dark adaptation process 
are the most important characteristics included in the visual 
process. These three characteristics are usually discussed sepa- 
rately, but obviously they are correlated with each other at all 
times. Roughly speaking, in a bright light the reaction time is 
short but it lengthens as the light decreases. Therefore, the 
temporal summation effect will increase in a darker place. In addi- 
tion to this, in the dark, the spatial summation effect will increase 
simultaneously with the increasing temporal summation effect, due 
to the light stimulus. In other words, in bright illumination the 
discrimination is great, decreasing as the light intensity de- 
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creases. On the other hand, the light stimulus will exhibit more 
spatial summation. Consequently, the retina will take advantage 
of the sensitivity to the weak stimulus at the sacrifice of the dis- 
crimination power of the light stimulus. In order that this concept 
be easily accepted by others, we would first like to give the fol- 
lowing simple examples. 

1) If we observe a neon sign during the daytime, we will notice 
that the neon light flickers intermittently. However, when we see 
the same neon sign at night, it shines as a continuous light. 
Actually this fact shows that in daylight the discriminatory power 
of the eye to the temporal change of the light stimulus is greater 
than in the dark. In other words, this fact will completely agree 
with the results observed first by Schaternikov (1902) in which the 
fusion frequencies of the photopic eye increased more than those 
ofthe scotopic eye. 

2) Suppose we turn off the light at night and permit the moon- 
light to enter the room through the window. For the first few mo- 
ments we are unable to see anything at all, due to the darkness. 
However, we are able to read the numbers written on the luminous 
dial of a wrist watch. As time passes, we are gradually able to 
distinguish the furniture in the room and we feel that the surround- 
ings have become brighter. But if we now look at a watch, we are 
unable to read the numbers on the watch dial as easily as before, 
in spite of the fact that we feel our eyesight has improved due to 
dark adaptation. On the basis of this example, we conclude that, 
as the eye adapts to the darkness, it gains in photosensitivity due 
to the retina’s increase in summation power for the given light 
stimulus, with a resulting loss in detail vision. 

3) When we see lightning, it appears as if it were a streak of 
light extending from the sky to the earth. But actually this light- 
ning is caused by an electric charge oscillation of a duration of 
less than one thousandth of a second. On the basis of this, it 
must be concluded that the peripheral retina and the central retina 
are stimulated simultaneously. The famous Japanese physicist 
Torahiko Terada considered this as an example of temporal illusion 
rather than spatial illusion. We can easily observe the same fact 
experimentally. Arrange several electric bulbs in a line. The 
center bulb is at the fixation point of the line of sight. When we 
turn on the lights simultaneously, it appears as if the center bulb 
lights first and the other bulbs light successively toward the pe- 


346 SADA YOSHI KAMIYA 


riphery so that the lights appear to run from the center to both ends 
simultaneously. The results of this experiment can be understood 
as a natural physiological phenomenon resulting from differences of 
the reaction time in different parts of the retina. Since the light 
appears to run from center to periphery, the reaction time at the 
periphery is longer than at the center. In other words, if we con- 
sider that the response of the eye to the light stimulus completely 
depends on the amount of light energy absorbed by the photo- 
chemical substances contained in each receptor then the true 
cause of this temporal illusion would be the difference in the 
amounts of photosensitive substance in photoreceptors at different 
parts of the retina. Thus if &, denotes the distance from the 
central fovea to each photoreceptor, the amount of the photo- 
sensitive substance in each photoreceptor AS. will be repre- 
sented by a function of F: ; 


AS,;=¢ iR,}. (1) 


4) As is well known there are many discussions concerning 
spatial illusions which are considered to be psychological prob- 
lems. In connection with this we would like to present some ex- 
amples which were given by the Japanese psychologist Z. Yokose 
(1957) in analyzing the visual process from the standpoint of W. 
Kohler’s ‘‘Physischen Gestalten.’’ 

If we project an after-image on a contour-line, the disappearing 
process of the image will differ according to the structure of the 
contour-line. For example, the disappearing process of the after- 
image projected on the straight line in Figure 1a will be different 
from that which results when the line forms part of a square, as in 
Figure 1b. 


a 
PAWN 
QYy 


(b) 


FIGURE 1. The process of the disappearance of the after image (aft 
Z. Yokose, 1957). Bact 
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If instead of the after-image, we project a small dim spot of 
light which is visible near the contour line, the threshold intensity 
of this spot must be increased as the spot approaches nearer to 
the line. 

Now let us explain Yokose’s theory in a slightly modified form: 
Any line segment such as (ab) shown in Figure 2 will to some 
extent influence a point P which is not on the line. The total 


(a) (b) 

a a 

is 

ress Sy Ay 

See 

ee ae oe 

ee 

potes 2 

id 

Pa stcd 2 ar 

Pi 
b b 


FIGURE 2- Potential field of the visual space (after Z. Yokose, 1957). 


influence coming from the contour line of brightness H to a point 
P corresponds to a potential p which is defined as follows: 


CoH 
p=k { oe 6dé, (2) 
-6 


where D is the distance from any point Q on the contour line to the 
point P, K is a constant, and @ is the angle between the normal to 
the line from P and the line PQ. 

The experimental results of Yokose agree with the type of formu- 
lation indicated by equation (2). However, note that (2) is a 
generalization of what Yokose wished to say. For the time being 
we should like only to introduce into our discussion the following 
conditions. If an excitation takes place on any part of the retina, 
it will influence to some extent the surrounding area, and the in- 
fluence might decrease almost in proportion to the distance from 
the excited point to the other point. Furthermore if many points 
are stimulated, simultaneously, the effects will be superimposed 
on each other.”’ 

5) As a final example, we will cite a very important observa- 
tion, which was first made by R. Granit (1930), concerning the 
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interaction of distant areas on the human retina. He first deter- 
mined the fusion frequency of four circular test patches, separated 
by an unstimulated area at the central retina and the peripheral 
retina. He then compared the fusion frequencies of each one of 
the patches in turn with the fusion frequency previously obtained. 
The results obtained were as follows: (1) There was a difference 
between the fusion frequency of the single test patch and the 
fusion frequency for the four test patches, and (2) the difference 
was very slight in the central fovea but large in the periphery. 
These results show that the interaction of each stimulus increases 
as the parts stimulated are more distant from the central fovea, the 
distances between each of the parts being kept constant. The 
facts illustrated by examples 4 and 5 above indicate that the 
cause of the so-called spatial illusion is due to the interaction of 
the photoreceptors on the retina. The magnitude of the interactive 
effect depends on the distance between two points. 

Let us summarize briefly the facts obtained from the preceding 
five examples. It has been shown in examples 1 and 2 that the 
temporal summation and the spatial summation of light stimulus 
might change depending on the different adaptation states. In 
examples 3 and 4 it has been shown that the temporal illusion and 
spatial illusion discussed in psychological terms might possibly 
be explained in physiological terms. In example 5 it has been 
shown that the summation effect of the retina will differ depending 
upon the part of the retina stimulated. 

In conclusion, we can say that all matters mentioned above are 
composed of three different processes: 

1) All visual sensation depends on the light and dark adaptation 
states, and the light and dark adaptation states depend on the 
amount of photosensitive substance contained in each photo- 
receptor. 

2) Excitation of each photoreceptor is determined as the result 
of the light energy absorbed by the photosensitive substance. 

3) The excitation which takes place in each photoreceptor will 
spread out over the random neural network of the retina. 

In addition to these three concepts, the following two facts 
should be taken under consideration: (1) there are differences in 
the amounts of photosensitive substance included in each photo- 
receptor between different parts of the retina, and (2) the excita- 
bility of an unexcited receptor is modified by the activity of neigh- 
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boring excited receptors. Then the various phenomena which are 
expressed in psychological terms might possibly be explained in 
physiological terms. However, there are difficulties in accepting 
these concepts, since, in the older concepts of visual mechanisms, 
the functional activity of the photopic eye and the scotopic eye 
are considered as separate entities. In order that our concept be 
accepted, it is necessary to discuss briefly the old concepts of 
the visual process. 

It is well known that the mosaic theory has taken its model from 
the anatomical structure of the compound eye of insects, because 
the compound eye has some basic resemblances to the mechanisms 
of the vertebrate eye. In order to understand the mosaic theory of 
vision, we first must understand the adapting mechanisms of the 
compound eye for different intensities of light. The element of 
the compound eye is called the ommatidium. Each ommatidium is 
connected to its own nerve fiber. Roughly speaking, the peripheral 
endings of all the nerve fibers in this type of eye must form a 
light sensitive surface similar to the mosaic of the photoreceptors 
(rods and cones) of the human eye. The sensitive element of each 
ommatidium receives light coming only from that part of the out- 
side world which exactly faces its facet. The result of this is 
that an erect image of the outside object is formed on the mosaic 
of the photosensitive nervous element. 

The adaptation of the ommatidium to light and dark is brought 
about by the migration of black pigment to different positions ac- 
cording to the state of adaptation of the eye. The eye in the light 
adaptation state is called an appositive eye. In this case the 
ommatidia are separated from one another by the black pigment. 
The light entering an ommatidium in the direction of its axis can 
reach the nervous photosensitive element; however, another light 
ray coming from the source striking the black pigment between 
ommatidia is lost by absorption without inducing any excitation. 
Thus, in the appositive eye a given point of the retina receives 
light only from a corresponding point of the object of vision. This 
results in increased acuity of vision. An erect image of the ex- 
ternal object is formed in compound eyes at the level of light 
receptors. The eye adapting to the dark is called a suppositive 
eye. In this case, as a result of the migration of black pigment, 
the layer of light receptors is placed at a distance from the layer 
of conical elements and is separated from them by a transparent 
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interval containing no pigment. Thus the light receptor of a given 
ommatidium can receive, in addition to the light falling along its 
own axis, rays of light entering structures of these ommatidia. 
Consequently, the eye achieves more sensitivity for weak light 
and loses acuity of the visual pattern. In an insect eye the change 
from the appositive eye to the suppositive eye shows the same 
characteristics as the change from photopic to scotopic and also 
results in an increase in sensitivity to light accompanied by a 
loss in acuity of vision. This fact is of interest in considering 
the development of animals or in comparative animal physiology. 

In spite of the functional similarity, there are important anatomi- 
cal differences between the insect eye and the vertebrate eye. 
The vertebrate eye is an inverted retina in place of the mechanism 
of the compound eye, and in the human retina there is no evidence 
of migration of the pigment in the different adaptation states. The 
reason for the human eye having the same acuity as the compound 
eye must be due to the nature of the retina or to the neuromechanisms 
of the network of the human retina. In this connection, the ob- 
servations which have been recorded by S. R. Detwieler (1943) 
concerning the thickness of the retina of twenty-seven kinds of 
animals will be helpful in considering the activity of the neural 
network of the human retina. Detwieler has shown relationships 
between the behavior of the animals and the thickness of the 
retinas of each animal. He stated that the external nuclear layer 
of all nocturnal animals is relatively much thicker than the in- 
ternal layer; whereas in the diurnal lizards, and particularly in 
birds, the internal nuclear layer is much thicker. This is especially 
apparent in the sparrow and the goldfinch. The great thickness of 
the internal layer in birds is in all probability an anatomical ex- 
pression of their great visual acuity. In the nocturnal form, the 
reverse is true, with many rods collected together into fewer bi- 
polar and association cells, so that the external nuclear layer is 
much broader than the inner nuclear layer. From these findings we 
can consider as a natural fact that the adaptation process depends 
upon the anatomical structure of the neural network of the retina 
and is especially dependent on the activity of the rod system in 
integrating the impulses coming from the photoreceptors of the 
area stimulated by light. In this respect, R. J. Lythgoe and K. 
Tansley (1929) have considered that the mechanisms of the dark 
adaptation process are due to the reorganization of the neural 
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connection which takes place during dark adaptation. S. Hecht 
(1928) has assumed that the various individual detectors are more 
active at higher than at lower illumination. 

The theories of R. J. Lythgoe and S. Hecht seem to give little 
direct evidence concerning the existence of anatomical changes in 
the retina. Therefore in considering the activity of the retina in 
the change from the photopic to the scotopic eye, we might expect 
that the anatomical structures of the retina remain perfectly un- 
changed and also we might assume that all the detectors are quite 
identical in all their properties. Assuming the above, only one 
problem would remain, that is, the existence of total color blind- 
ness and of simple congenital night-blindness, since the former 
represents an anomaly in photopic vision and the latter in scotopic 
vision. The existence of these two different kinds of anomalies 
of vision would actually provide evidence that the phenomena 
occurring in the photopic eye and in the scotopic eye might be due 
to different functional activities. If we were to assume that the 
functional activities in the scotopic eye are completely different 
from those in the photopic eye, that is, photopic vision taking 
place only in the cones of the cone system and scotopic vision 
only in the rods of the rod system, how would we then explain the 
fact that the process of adapting from light to dark seems to be a 
continually changing process. We were fortunate to be able to 
examine a thirteen-year-old boy with total color blindness and a 
seventeen-year-old boy with simple congenital night-blindness. 
Using fusion frequencies as an index of the area effect, we com- 
pared each of them with normal persons. The result of the obser- 
vations, using different states of adaptation, revealed that in 
simple congenital night-blindness the area effect was as large as 
that of the normal retina. However, total color blindness did not 
show any area effects. When we illuminated the surrounding field 
of the test patch with white and blue lights, the fusion frequencies 
of both simple congenital night-blindness and of normal persons 
increased at the peripheral retina. The fusion frequencies also 
increased with larger test patches and higher intensities of the 
surrounding illuminations. Total color blindness, on the contrary, 
showed entirely the opposite results. The fusion frequencies were 
reduced at the peripheral retina and decreased with larger test 
patches and higher intensities of surrounding illumination. 

From the results obtained we see that total color blindness and 
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simple congenital blindness are not caused by the absence or 
defective function of the cone system or the rod system, as has 
been assumed on the basis of the old concept of visual mecha- 
nisms. In accord with the above findings, the only hypothesis the 
author is able to offer is that the cases of color blindness and of 
night-blindness provide evidence that two opposite processes 
exist. One process might be designated as the facilitation process 
in the synaptic neural nets, and the other process as the inhibit- 
ing process which blocks the facilitation of the impulses in the 
nervous pathway. If the inhibiting process is impaired in some 
way, the result is total color blindness. On the other hand, if the 
facilitating process is impaired and if the light stimulus acting 
upon the retina is weak, the result is night-blindness. 

We are able to summarize this first section as follows. The 
functional activities of the photopic eye and the scotopic eye must 
be continuous, the cone systems and rod systems working together 
on both the scotopic eye and the photopic eye. The difference 
between the scotopic eye and the photopic eye depends only on 
the difference in the amounts of the photosensitive substance 
contained in each photoreceptor. The difference in the amount of 
photosensitive substance contained in each photoreceptor might 
account for the difference in the power of temporal summation and 
spatial summation between the photopic and the scotopic eye. 
The interactive effect between each receptor stimulated by light 
on the retina must somehow be dependent on the difference in the 
amount of photosensitive substance in each photoreceptor. This 
conclusion must enter into the formulation of the complete visual 
process in mathematical terms. There remains then the question 
why does the photopic eye have good discrimination power while 
the scotopic eye has good summation power for the light stimulus? 
This problem must be resolved in terms of the structure and func- 
tion of the neural network of the retina. 


Il. A Proposed Model of Visual Mechanisms Regarding the Photo- 
chemical Processes and Propagation of Nerve Impulses. Accord- 
ing to the photochemical scheme of Hecht, a photosensitive 
substance S is converted reversibly into the photoproducts P and 
A. Thus: 

light 


Sa Plt 
dark 
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If @ is taken to denote the total available concentration of the 
photosensitive substance and 2 the amount which is in the form 
of photoproducts, the following equation for the kinetics of the 
reaction may be derived: 

dx 

ae i (ee) — ke, 2", (3) 
where k, and k, are velocity constants for the light and dark re- 
action, respectively. 
During the dark, the reaction which takes place may be written as 


dx - 

at = =e ve (4) 
Adding to these mechanisms, Hecht has assumed a ‘‘secondary 
dark reaction,’’ in order to explain the process which causes 
stimulation of the nerve endings. According to his explanation, 
when the retina is exposed to the light, the photosensitive sub- 
stance S, originally formed from its two precursors (P and A— 
precurser and accessory), is changed back into them under the 
influence of the light. And one or both of the freshly formed 
precursor substances immediately serve to catalyze the trans- 
formation of an innocuous material L into a stimulation substance 
T. This occurs during the latent period. The velocity of the 
formation of the stimulation substance 7 is a linear function of the 
amount of precursor substances formed during the exposure period. 
When a sufficient amount of the stimulating substance T has been 
accumulated, it acts on the nerve endings and initiates the sensa- 
tion of the eye. 

The entire sensory process may therefore be written as 


Riga pA A PST 3 


in which the symbol ||P + A|| signifies the catalysis by one or 
both of the precursors (Hecht, 1919b). 

The concept of the ‘‘second dark reaction” was derived from the 
following experimental results. 

When Hecht measured the reaction time of the Ciona as the 
parameter showing the effect of light stimulus, in connection with 
the amount of photochemical products necessary to act as a stimu- 
lus in the sense organ, he was able to show this relationship: 


E = kIT, 
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where E denotes the effect of the light stimulus in a given stimu- 
lated area. On the other hand, he defined the reaction time as 
being composed of two parts: the first period was the exposure to 
the light, the second was the latent period during which the Czona 
need not be illuminated. From his experimental results he then 
introduced the linear relationship between the reciprocal of the 
latent period and the sensitization period, namely, 


1/t° = 9T + ¢, 


where ¢’ denotes the latent period, g and c being constants. From 
the similarity of these findings to a well-known group of photo- 
chemical reactions, classed by Weigert (Hecht, 1919b) as ‘catalytic 
photochemical reactions’? in which the catalyst remains after the 
action of light, Hecht proposed the photochemical process men- 
tioned above. 

However, in spite of his explanation about the ‘‘secondary dark 
reaction,’’ it is very difficult to understand the meaning of the new 
P +A (freshly formed precursors) which is the direct cause of 
stimulation. 

When we consider electro-neuro-physiological observations, 
visual sensations might be produced by the propagation of nerve 
impulses resulting from the effect of light stimulus. The experi- 
mental results observed by E. D. Adrian, R. Matthews (1927) and 
H. K. Hartline (1940) are given by the expression 


ITA=k. (5) 


This relation states that when some area A on the retina is stimu- 
lated with an intensity / of light for the duration of time 7, the 
frequency of the volleys of propagating nerve impulses observed 
on the optic nerve fiber is proportional to the total amount of light 
energy absorbed by the retina. From their experimental results, it 
is natural to assume that the effect of light acting on the visual 
system is due to nothing but the total amount of light energy 
absorbed by the retina. 

In general, the absorbed light, /,, is proportional to the con- 
centrations of the absorbing substance, namely, 


I, = ledS (6) 


where / denotes the intensity of the original light, I, the light 
energy absorbed by the absorbing substance, d a constant related 
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to the absorption rate, and ¢ a quantity which depends upon wave 
lengths used in the exciting beam. On the other hand, when we 
consider the retina each photoreceptor must be a unit absorber of 
light energy. If a stimulated area has n photoreceptors and the 
concentration of the photosensitive substance in each photo- 
receptor is denoted by AS,, the total energy absorbed by the 
stimulated area can be given by the equation 


E = TI, =ITnAS, ed. (7) 


If we could determine experimentally the functional relation be- 
tween the concentration AS, of the photosensitive substance in- 
cluded in each photoreceptor and the distance R, from the central 
fovea to each photoreceptor, namely, AS, =  {k,}, we would have 
the following expression for S,, the total amount of § in the 
area A, 


a =|[ otedhaedy (8) 
A 


E = ITS ,ed. (9) 


The integral in (8) is to be taken over the stimulated area A, with | 
the origin of the coordinate axes zx and y taken at the central 
fovea. Note that we have assumed central symmetry. 

Now it is natural to consider that E may produce propagation of 
nerve impulses which are actually observed as electrical oscil- 
latory changes. Thus, if we propose a model as shown in Figure 
3, the light energy is absorbed by the photosensitive substance, 
and the photochemical process acts as a battery and produces an 
electromotive force to the cell membrane which corresponds to the 
R.C. oscillation circuit. The R.C. oscillation circuit consists of 
a glow discharge tube connected in parallel to a condenser. The 
charging of the condenser continues until the voltage has become 
equal to the breakdown voltage, namely, the threshold value of the 
membrane. When the threshold is reached, the membrane is dis- 
charged, the voltage falls, and a nerve impulse is propagated. 
The membrane then is recharged and the whole process is repeated 


In this case the total amount of light energy ab- 


eriodically. 
P y Thus we 


sorbed E must correspond to the electromotive force. 
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have 


d 
Ru -44R pea ME 
C lie erie 
(10) 
1 
ae ne! 


ae Re ee eke 


where Rf denotes the resistance, C,, the capacity, and Q the 
charge of the membrane, respectively. Integrating equation (10) 
we have 


Q eee 
= -e (1-6 Kam | (11) 


* 


If V* denotes £ which shows that the value of the voltage of the 


m 
membrane increases to the threshold of the breakdown voltage at 
the time 7;7 is given by the equation 


E 
r= kh C, log— 


=F (12) 
where 7 is the period of oscillation of the relaxation oscillation. 
The frequency v of the volley of propagating nerve impulses is 
inversely proportional to the period of oscillation. 


CELL-MEMBRANE 


R.C. OSCILLATION 
CIRCUIT 


DARK $ PHOTOCHEMICAL 
PROCESS 


PHOTORECEPTOR 
FIGURE 3. A proposed model of visual mechanisms. 
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In ordinary circumstances, nerve impulses will usually propagate 
under the condition that E >> V*. Thus we have 


rai GV 7 Ry (13) 
so that since py = 1/r 
E=Kvy, (14) 


where K = RCV™ = constant. 

Now let us consider Hecht’s model. From his assumption, in 
the equilibrium state for a given intensity /, equation (3) may be 
written as 


dz : 

a 7 0 = h(a to) — a to + (15) 
it 

And when the eye is exposed to light of intensity I, + Al, the de- 

composed substance is increased up to 2, +Az. Thus we have 


dz 
a 7 Bil + AD(a- ty - Az) -k, (a + Az)?, (16) 


Substituting equation (15) into equation (16), we have for small x 


dz 
ee eet) =k pal8 (17) 


Integrating this we have, for short times, 


Azg= 2-2, =k, Alts,. (18) 


This Az is clearly equivalent to the so-called freshly formed 
precursor discussed by Hecht. We shall show subsequently that it 
will not be necessary to assume that the freshly formed precursor 
plays a special role in causing stimulation. 

Discussion. The usual method of study of dark adaptation con- 
sists essentially in exposing the subject to light of measured 
intensity (the so-called bleaching light) for a measured time (the 
so-called time of dark adaptation) and then determining the bright- 
ness of the light which will just barely produce readings of thresh- 
old values which are plotted against time in the dark. Here we 
have two different ways to show the observed dark adaptation 
process, which depend upon the definition of the threshold in- 
tensity. Namely, in one, the sensitivity of the eye is defined as 


358 SADAYOSHI KAMIYA 


the reciprocal of the threshold intensity; and in the other, it is 
defined as the logarithm of the threshold intensity. Both ways 
are usually used to show the dark adaptation process. However, 
according to S. Hecht (1919a) it has been confirmed experimentally 
that, when the intensity of the stimulating light is kept constant 
and its time of action varied, the photochemical effect E is a 
linear function of the time @. 


E = kt. (19) 


On the other hand, if the time of action is kept constant and the 
intensity varied, the photochemical activity of the light is found 
to be directly proportional to the logarithms of the intensity /. 
This may be expressed as E=klog/. Then if both the intensity 
and the time are permitted to vary, the photochemical effect should 
be proportional to the product of the time of action and the loga- 
rithm of the intensity. In other words, 


E = ktlogl. (20) 


Then Hecht finally assumed that the amount of photosensitive sub- 
stance in the dark adaptation process is proportional to the 
reciprocal of the logarithm of the threshold intensity. That ex- 
pression may be written as 


C 
log/ 


S, (21) 


where C is a constant which is related to the sensitivity of the 
eye. If we define the sensitivity of the eye in this way, we are 
able to calculate the amount of photosensitive substance which is 
contained in the photoreceptor at any given time in the dark by 
determining the threshold intensity which is easily measured 
objectively. 

Now let us try to verify the photochemical reaction which takes 
place in the photoreceptors. For this purpose it is most con- 
venient to observe Mizuo’s phenomenon which occurs in the case 
of Oguchi’s disease. The dark adaptation process in Oguchi’s 
disease is markedly delayed when compared with that in a normal 
person, and there is also a change in color of the retina. Inthe 
light adaptation state the retina is gold in color and returns to 
normal in the complete dark adaptation state. Y. Ehara (1936, 
1939b) made many important analytical observations about Mizuo’s 
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phenomenon. By using Hecht’s definition he calculated the photo- 
chemical process which takes place in the photoreceptor in 
Oguchi’s disease by changing the threshold intensity in the dark 
adaptation process. 
When / = 0 in equation (3), we find 
Ly -2 


Oe aa 2 
‘0 


where ¢ is the time in the dark and k, denotes the velocity con- 
stant of the reaction. The relative values of %, and 2 are ob- 
tained from observed threshold values of intensity. By using this 
equation, Ehara calculated that the velocity constant & has almost 
the same value in the many cases of Oguchi’s disease which were 
observed by different scholars, and that the dark adaptation process 
in Oguchi’s disease was explained in terms of the bimolecular 
chemical reaction already suggested by Hecht in the normal eye. 
However, in Oguchi’s disease the velocity constant of the reaction 
is equal to between 1/15-1/13 of that of the normal eye measured 
under the same conditions. From these results, it seems to be 
quite adequate to assume that the dark adaptation process takes 
place according to the bimolecular reaction. 

Lowever, Y. Ehara (1936) observed that, when the eye in 
Oguchi’s disease became adapted to a dim light for a long time 
and the light was turned off, the threshold intensity suddenly 
dropped (Figure 4). He could not explain this phenomenon by 


os 7a Ae ie 8 5 42 49-256 63 70). 77™ 8A On 98) ICS 
FIGURE 4- Dark adaptation process in Oguchi’s disease (observed by 
Y. Ehara, 1936). ‘The downward arrow shows the time that dimlight is 
turned off. The upward arrow shows the time that dim light is turned on 
The continuous curved line shows dark adaptation process without dim 
light adaption. 
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Hecht’s assumption. If Hecht’s assumption that the threshold 
intensity must correspond with the amount of photosensitive sub- 
stance in the photoreceptor is correct, and if the dark adaptation 
starts from the same threshold level, the dark adaptation process 
must proceed at the same rate. 

Similar results were obtained on the velocity of the dark adapta- 
tion process of the normal eye from various experiments carried 
out in the following way. First the eye was adapted to a high 
intensity of light. Next the eye was exposed to a dim light, in 
order to observe the dark adaptation process (Ehara, 1939c). 
From experimental results obtained in this way, Ehara concluded 
that the difficulty in explaining this visual phenomenon by using 
Hecht’s assumption lay in the fact that the determination of 
threshold intensity and the velocity of the adaptation process 
depended only on one factor. 

Therefore he proposed the following model 


light light 
— BE, (23) 
dark dark 


His model assumed the existence of an intermediate product in the 
chemical reaction between S and P+ A. Ee named this inter- 
mediate product E which was taken from the first letter of his 
last name. Ehara explains his model in the following way: 
(1) The factor E has an action which facilitates the dark adapta- 
tion of the rods. (2) With light, E changes to other substances 
which have no effect in promoting the dark adaptation. (3) The 
factor is photosensitive. When we consider the photochemical 
reaction, statement (3) seems inadequate. Furthermore the mathe- 
matical formulation of this model is difficult. However, it is 
important that Ehara pointed this out. 

After the second World War, very detailed experimental results 
about regeneration of visual purple were demonstrated by Wald 
(1935, 1947, 1951). Wald demonstrated that during dark adaptation 
there are two chemical reactions by means of which visual purple 
is regenerated (Figure 5). When visual purple (rhodopsin) is 
exposed to light, an intermediate transient orange is produced 
very quickly. This then is converted into visual yellow. This 
intermediate product between visual purple and visual yellow 
corresponds to E which was suggested by Ehara. 
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FIGURE 5 


When we consider only the process which takes place in the 
dark, regeneration from visual yellow is much more rapid than the 
reaction chain involving visual white. According to Wald, as the 
degree of light adaptation is increased, the amount of pigment in 
the form of visual white is increased. ‘Therefore, it is possible 
for the apparent regeneration curve subsequent to a small amount 
of light adaptation to be quite different in form from that subse- 
quent to a large amount. 

From this point of view, T. L. Jahn (1946) demonstrated a new 
model to explain the visual mechanism. He assumed that the 
regeneration from visual yellow to bimolecular and that the re- 
generation from visual white is autocatalytic. He also assumed 
that when we consider the regeneration process in the dark, the 
reaction from visual yellow to visual white can be omitted. 

Yowever, H. Tagami (1951) demonstrated that the time in the 
dark in which dark adaptation is completed is determined within 
wide limits by the total amount of the adapting light (intensity 
times the exposure time). The same results were obtained by 
C. Haig (1941). These results actually mean that the regeneration 
process (which is shown by the dark adaptation process) following 
a weak light stimulus is not essentially different from that follow- 
ing an intense light stimulus. 

Some basic points of Jahn’s theory, therefore, seem to be in- 
adequate. Thus we conclude that the reaction chain involving 
visual white does not essentially relate to the physiological 
process. Hence, when we consider the physiological process of 
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visual mechanisms, it is sufficient to consider only the cycle 
(visual purple—transient orange—visual yellow—visual purple) 
in order to explain the basic mechanisms. 


Lu. A New Model of the Visual Mechanisms and Application to 
Various Experimental Situations. In view of the above discussion, 
we shall consider in some detail the following model: 


Soe CAA eR 2 


The photochemical process in this model may be described by the 
following equations: 


dS 


Gp Th IS + ha E+ ky P*, (25) 
dE 

phy Seg hes (26) 
dP : 

ee es 7 (27) 
as. ak 6P 

dip dic de 5) 
S+E+Pea (29) 


The electrical process which results from the photochemical 
changes and which in turn produces the propagation of nerve im- 
pulses is described by the equations (10). When we consider 
each photoreceptor as a unit absorber of light energy, the total 
amount of photosensitive substance which is included in the area 
stimulated by light is again given by equation (8). 

We may see this entire process in Figure 6. This figure shows 
that when a photoreceptor is stimulated by light, it will produce 
an e.m.f. which corresponds to E, which in tum depends on the 
amount of total energy absorbed by the retina E. The e.m.f. will 
result in propagation of impulses in the retina through the random 
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FIGURE 6 


neural network and will then arrive at a ganglion cell which is 
called a ‘‘midget ganglion cell” by S. L. Polyak (1941). Each 
ganglion cell is connected to one optic nerve fiber. The membrane 
of each ganglion cell has the same threshold breakdown voltage. 
The frequency of nerve impulses propagated is proportional to the 
e.m.f. at the membrane of each photoreceptor. 

Connecting this model with the model of the visual process 
discussed in the first section of this paper, the area effect will 
be expressed by equation (8), and the interaction effect which is 
suggested by Yokose will be expressed by the electric flow in the 
retina. 

1. Light and dark adaptation processes. Figure (7) illustrates 
experimental results on dark adaptation. The resulting dark adap- 
tation was measured by Y. Ehara (1939c) when the eye was first 
adapted to the light / and then the light was turned off. We are 
able to calculate this case easily from the model. From the results 
of experimental observations on visual purple, the reaction from 
S to E is much faster than the reaction from P to S. Therefore 
when we consider the fast reaction of E with respect to the slow 
reaction which is related with the adaptation process, E will be in 
an equilibrium state in a very short time. Thus we have 


S+E,+P=a, (30) 


k,l 
—_—1___§ since E is an equilibrium state. Substitut- 


ae +k, 
ing FE into Eaetion (27) we have 


where Ey 


dP 
ee age aed (31) 
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ky 
k,l+k +k, 
light. Integrating equation (31) in the usual way for P = 0 when 
t=0, since light adaptation has started from the completely 

adapted state, we have 


where k& = 


constant for a given intensity of adapting 


_g7PT 
Piro le it ee (32) 
a saat 2 ee here w en bie 


4 ak ; 
where p=Kk, k= 1+ : , and 7 is the duration of light 


adaptation. P, denotes the concentration of P at T. On the 
other hand, in the dark the reaction of P is shown in the following 
way 


dP 
—=-k, pale (33) 
dt 

P 
P(t) = an 34 
(2) 1+k,P,t , soa 


where Py denotes the initial concentration of P at the time when 
dark adaptation has started, and P(t) denotes the concentration at 
the time of ¢ in the dark. 


Now let us attempt to consider the mechanisms determining the 
threshold intensity. 


In the usual way, the threshold sensitivity has been determined 
by the brightness of light A/ which just barely produces the sensa- 


BRaE™ 
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FIGURE 7. Dark adaptation process after various idee of inges adap- 
tation (after Y. Ehara, 1939), Ordinate, log threshold; abscissa, time 
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tion of vision. In this case, for the threshold value of E* from 
equation (26), we have 


Baan 
Miata om NN 35 
oe) Pa : ee) 
h(k, +k 
Lees 


If we denote A/™ as the value of A/ when T = 0, P, = 9, we have 


Peele — en’ + Omi(i —e0°*) 
AI” (R14) + (R+ 1)e767 + Qmt(1- PT) 


(37) 


where m = ak ,- 


When ¢=0, this equation shows the light adaptation process as 


Al (R+1)+(R- Leas. 


ayy elle ees Aa (38) 
Ale TYR Ae th 4 )es" 
When 7 —>» ~, ¢=0, as the initial value, we have 
Al 1 
OF i Pae (39) 
ALP fered 
: k r 
Now putting = I, we have 
4ak, 
AeetR IPRS 1) eee = eH RT) ny 


en) eee tye Ik acd — eIkT) 


where 7’= 4ak,7, and t’= 2ak,t, these times being proportional 
to the times in the light and in the dark, respectively. 

Thus if we give a value of /, we are able to calculate the whole 
process of light and dark adaptation. The result of the calculation 
when / = 1 is shown in Figure 8. 

2. Critical fusion frequency. When we want to discuss fusion 
frequency, we have to consider only the quick reaction, since ete 
flickering sensation usually takes place in a very short time 
compared with the adaptation process of the eye. Thus, we are 
able to assume that the amount of the photosensitive substance S 
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FIGURE 8- The curves represent equation (43) for/ = 1. 


is constant in a given adaptation state. When we use a rotating 
disc in which each period is 7, the exposure time is p?¢, and the 
latent period is (1 — p)T. 

From the Talbot-Plateau law, when the frequency of the rotating 
disc increases above the critical fusion frequency, the brightness 
sensation which is produced by the intermittent light is equivalent 
to the sensation which is produced by the continuous light of 
intensity p/. Thus from equation (26), the brightness sensation 
which is produced by a frequency just over the critical fusion 
frequency is obtained as 


k, Spl 
E (pl) = —--—_ = Gp, (41) 
ka +k, 
where 6 =k, S)/(k, +%,). E(pl) is the value which corresponds 
to the brightness sensation and is the average value of E over a 
cycle. The value of § is given from the relation 


k,k,plS=k,(k, +k,)(a - S)?. (42) 


Its value depends upon the intensity / times the fraction p. We 
may ignore the change of § or P with time but not E. Let Ey be 
the value of F at the moment the light comes on during any cycle. 
From equation (26) we may calculate the change of E with time 
and in particular the value of FE, the value at time pT when the 
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light goes off. Since we are interested in short times we may 
write the solution to (26) as 


E = E(pT)=E, [1 -(k, +k, pT +..-) +h, SpT+.... (48) 


But during the time (1 - p)T7 when the light is off, E decreases 
from E to a value which must be equal to Ey, that is, the value 
at the end of a cycle must be the same as that of the beginning of 
the next cycle. Hence 


—(katks) (1- 
Ee ~ | s) PTE, =E [1-(k, +k,)(1-p)T+..-]. (44) 


Then introducing AE=., —E,, & measure of the amplitude of 
the fluctuation, we have from (43) and (46) 


AE =E_ -E, =k,!Sp(1-p)T, (45) 


where S depends upon p/ according to (42). 

Denote by A the critical value of AE when T has the critical 
value 7*. The critical fusion frequency f is the reciprocal of f: 
so that we have 

1 


k,l 


This equation is similar to H. D. Landahl’s equation (Householder 
and Landahl, 1945). Note that S depends upon p/ according to 
(42) and hence depends upon the adaptation state. 

3. Effect of various forms of test patches. In order to evaluate 
the effect of various forms of test patches, we shall first calculate 
the fusion frequency for rectangular stimuli of constant area to 
determine how the frequency changes as the stimulus is deformed 
from a square into a long, narrow rectangle. Then we shall con- 
sider the effect of the eccentricity on the fusion frequency of 
elliptic test patches. We shall also consider how the fusion 
frequency will change when a constant circular test patch is 
moved from the fixation point to the peripheral retina. 

In order to make these calculations we need an expression for 
@{R.}. The experiments by T. Horiuchi (1951) are helpful in 
determining this relation. At first Horiuchi confirmed Ferry- 
Porter’s law by using circular test patches. He then varied the 
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radii of the patches and obtained the following results: 


f=alog 7k? +, (47) 


where RF is the radius of the test patch when it is measured with 
the center of the patch as the fixation point. Then the fusion 
frequency was measured using line test patches. The experi- 
mental results showed that the fusion frequencies can be expressed 
by the following equation 


f=2alog kc’, (48) 


where F is the length of the line test patch with one end of the 
line as a fixation point. 

From these facts we can assume that the concentration of the 
photosensitive substance in each receptor is equal to the recipro- 
cal of the distance from the central fovea to each photoreceptor. 
Thus we have 


AS, = d(R,) = ¢/R,, (49) 


where € is constant. Strictly speaking we should add a constant 
to the denominator of the right-hand side of (49) since the values 
must remain finite at 2; = 0. We shall, however, ignore this effect. 
Substituting this into equation (8), we have 


1 
Se ie ae (50) 
A ke, ; 
Then substituting S$ , from (50) for S in equation (46) we have 
I 
f= Cle p) [fp dady, (51) 
fe 


where ¢’=k,¢/h and where R, denotes the distance from the 
central fovea to a point (z,y). Note that K, is proportional to ¢d 
of expression (6). 

From this equation we can calculate the fusion frequencies for 
any form of test patch (A), using a rotating disc with a given dark 
and light ratio. Also, if we substitute the value of AS;, in the 
equilibrium adaptation state for a given intensity /, we are able to 
evaluate the fusion frequency. 
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Now we shall calculate the total effect of the light stimulus in 
three experimental cases. The calculations are of importance in 
confirming the ‘‘area effect.’’ 

a. The case when the test patch is rectangular. Using the rec- 
tangular coordinates, we denote the length along the x and y axes 


2 
as = and At respectively, and take the origin to be the fixation 


point, and let z,y, = constant. If we take the case in which the 
area is constant, it might be expected that the sensation remains 
the same. However, it is found experimentally that beginning with 
a square, as z becomes longer and y shorter, the resulting sensa- 
tion of the rectangular test patch reduces almost proportionately. 
This experimental result can be explained by the integration of the 
equation (51) as follows: 

Using the polar coordinates R, and 6, we can express equation 
(51) as follows: 


S, = Cffddk, (52) 
5 ; v, Y;, 
The corner of the rectangle is the point ea ie and @ has a 


value 6, such that tan 0, = y,/2,. 
Integrating over A; from zero to R, = tay sec 6 when @ < @,, and 
from 0 to RF, = 5acsc 6 for 6 > 6,, we find 


7 
A, x ae) | 
S, =4 __ do *_ d6}. 53 
cs lf 2cos 6 eh 2 sin 6 Se 
1 
Integrating this equation and letting p = y, /a,, we have 


S,=2¢a,[log(y1+ p? - p) + plogp — plog(/1 + p? - 1)]. (54) 


b. The case when the test patch is elliptic. Suppose the form of 
the test patch is changed from a circle to an ellipse with the same 
area. In this case, if a@ and 5 are the major and minor axes, 


2 2 
~+5-1 (55) 
a b 
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Introducing polar coordinates, (55) becomes 
b 


| ee rea 
‘  J/1 = €2 cos? 6 ; (56) 
where ¢ is the eccentricity of the ellipse, e? = 1- 5?/a*. Hence 
from (52) and (56) we find 


2 2 : 
sy 200d {t+ (=) 7 + a e* + a rer (57) 


e being the eccentricity of the ellipse. It a= 6, «= 0, therefore 


S, = 27) = 27k, FR being the radius of the circle. Since we wish to 


keep the area of the ellipse constant, we have the additional relation 


7ab = constant = 7R?. 


3. The case when the center of a circular test patch is not at the 
fixation point. If we denote R, as the distance from the fixation 
point to a point (a, y), FR. is expressed by the equation 


R, = Vr? - 2rr’ cos 6477, (58) 


where r’is the distance from the fixation point to the center of the 
test patch, r is the distance from the center of the test patch to a 
point (z, y), and @ is the angle between 7 and r’. And in the case 
when the test patch includes the fixation point, namely, the case 


if 
of r>r’, 7 is given by the equation 
i 


1 co an 
re Me aad P (cos 6). (59) 
OS 


The case when the test patch excludes the fixation point, namely, 


1 
the case of r< r, ry is given by the equation 
i 


Go n 


1 r 
as = >: Bast P. (cos 6), (60) 


i n=0 
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Mhere P, (cos @) is a Legendre polynomial. Thus if PR is the ra- 
dius of the test patch and if R < 1’, we obtain 


5 eke: , 
0 0 R, 83) 


aR? lefle\ Sard akR\* Oaasic/ R\o 
a= € —s1+ —(—} + —(— = 
s r { s (2) Cy) 1 0ns ese = 
In ‘2 R, the contribution to S, from the region r< r’ is given by 
(62) with r= ?°, which we denote by S,(?’). The additional contri- 
bution from that part of the test patch for which >’ is obtained 


by introducing (59) into (61) but using r’ instead of zero as the 
lower limit of integration. In this way we obtain, for r’ > R, 


pee (r\ bare 
Sey ig A? 6 Ry Fd ns eee peerings eee 
ea) { : (5) 64 (5) 256 (=) oes. 


1 3 5 

pat are 556 | 
These examples can be easily confirmed experimentally and also 
these three cases are very interesting from the neurophysiological 
and the psychological point of view. In a subsequent paper the 
relation and agreement between the theoretical results, which are 
presented in this paper, and the experimental results will be dis- 
cussed in detail. 

4. The problem of visual acuity. In this case, as it is shown in 
Figure 6, we have to consider the electric conduction through the 
random network of the retina. Also in the ordinary case when we 
discuss visual acuity or the ability to distinguish between two 
stimulated points, we have to consider the problems which take 
place in a small area of the surface of the retina. Thus, we may 
assume, that in a small area each photoreceptor has the same 
amount of photosensitive substance. In this case it is natural to 
assume that conduction will take place analogous to the phenome- 
non of the heat conduction in three dimensions. 

If this assumption is true, we are able to resolve this problem 
_ easily. However, we would like to discuss this problem in detail 
in the next paper where we will consider the fine-microscopic 
structure of the retina in connection with the minimum threshold 


(63) 
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quantum number necessary to produce visual sensation. We are 
now determining this minimum threshold quantum number with the 
cooperation of S. Yamamoto, a mathematical statistician, and the 
members of the Department of Ophthalmology at Nara Medical 
College, Japan, using a new method which is different from the 
method previously reported (Kamiya and Yamamoto, 1953). 


The author wishes to thank Professor N. Rashevsky for the 
opportunity of studying mathematical biology for the past two 
years with the Committee on Mathematical Biology, The University 
of Chicago. The author is also indebted to Professor H. D. Landahl 
for many stimulating and enlightening discussions. 


One of the most promising young Japanese professors in the 
field of research concerning the dark adaptation process of vision 
was the late Dr. Yukichi Ehara. Dr. Ehara had recorded his plans 
for future research in the field of the physiology of vision, and the 
author was given his notebooks by Dr. Hirotoshi Tagami, who had 
been working under the late Professor Ehara. Deciding to con- 
tinue Ehara’s research the author spend the past ten years study- 
ing exclusively the physiology of vision with Drs. H. Tagami, 
Tetsuya Horiuchi, Mutsuo Yamagishi, Akira Momose, Masanori 
Iwagaki, and others. The author was especially helped by the 
mathematical statistician, Mr. Sumiyasu Yamamoto, with regard to 
the subject of quantum biophysics of vision. 
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Chicago 37, Illinois. 
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